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ABSTRACT 


A number of models to describe the time pattern of the 
utilization of crude oil in the continental United States (excluding 
Alaska) were proposed in this study. Whenever possible, boundary 
conditions were also stipulated, which were intended to reflect the 
characteristics of growth, and eventual decline, in the rate of 
utilization of oil. Only two of the original twenty-two models 
proposed, (the generalized logistic, and a model involving the 
pattern of percent recovery) gave a reasonable estimate of the 
ultimate recoverable resource base, and were not rejected because 
of the tests imposed on the models and on their error terms, or 
residuals. While the residuals were not entirely random, the 
model with the minimum sum of squares for error, and which was 
selected as a logical candidate to project annual production was 
the generalized logistic curve. This model implied the ultimate 
resource of recoverable oil in the United States would be over 
300 billion barrels, with annual production reaching a maximum 
in the year 1985. The upper limit of recoverable oil would 
probably be less than 400 billion barrels, because this was the 


figure estimated for the ultimate discoveries of oil in place. 
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CHAPTER 1 


INTRODUCTION 


The purpose of this study is: 


1. to leam what factors relate to the long term availability and 
utilization of oil in the United States, 

2. to study and develop mathematical models for use in long term 
projections of the rate of discovery and utilization of oil, 

3. to study any relationships which may exist between measures 
of the availability and utilization of oil, and 

4. to apply the models in an attempt to predict future availability 


and utilization of oil. 


This study was thus an attempt to provide an overall picture of the 
dynamics of the oil industry, using any long term measures that were available 
for the period 1859 to 1968. 

The United States (excluding Alaska) was chosen as the region for 
investigation, because of the amount and quality of data available. More than 
likely, the ultimate resource of oil will eventually be discovered and consumed 


in that region. This possibility is shown graphically in Figure 1.1.1. 
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Knowledge of either the discovery or the consumption process might 
allow prediction of the ultimate extent of the resource of oil, even though one 
would be based on estimates of the rescurce available, and the other on 
consumption by actual production of the resource, 

The objective was to study some of the relationships between measures 
of the discovery and utilization of oil, and to incorporate these measures into 
a model which would predict the remaining recoverable resource of oil. The 
model developed should have as few a priori assumptions as necessary, and 
would make use of only the most reliable data, 

The difficulties of relating financial factors to oil industry activities, 
and allocating costs to the specific products produced, has been discussed 
elsewhere.” Thus, this study was concerned with factors in which financial 
measures were only implicit in the actual functions used, Similarly, functions 
in which measures were related independently of time were also of interest. 

For practical reasons, the data used pertained to crude oil - the 
main petroleum product obtained by the oil industry. Also, only a portion of the 
data available has been employed, and not all interrelationships have been 
analyzed, To do an exhaustive study of this complex industry would require many 
years. In this study, some relationships between factors have been expressed 
in graphical form, The illustrations are meant to be suggestive, but not proof 


of relationships. 
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CHAPTER II 


SOME FACTORS RELATING TO THE DISCOVERY 
AND UTILIZATION PROCESS 


SECTION ONE - STRUCTURE OF THE OIL INDUSTRY 


Much of what happens in the oil industry is the outcome of four 
fundamental characteristics of oil. It is difficult to find; it is not found 
where it is mainly wanted; it is an inflammable liguid, and it has special 
properties. The risk involved in finding the oil in the first place, plus having 
to cope with undesirable location in many cases, necessitates large capital 
investments. Additional capital is required for transportation and refining 
facilities. Finally, for major firms to insure an adequate return on all capital 
that is invested, marketing facilities are necessary to sell the spectrum of 
products derived from the oil. Therefore, fully integrated oil companies are 
usually very large in terms of capital investment. 

The continued growth of the freeworld petroleum industry involved 
about $11.2 billion yearly for capital expenditure during the early 1960's, and 
is expected to rise to about $20 billion yearly by 1975 - giving a total of about 
$266 billion dollars during the 1960 - 75 period. Of this, 60% is to be allocated 
to the replacement of existing capacity and 49% for growth. It is not practical 
for the petroleum industry to draw its funds from the stock markets, because the 
40% needed for growth expenditure in oil alone represents about half of all 
funds raised on capital markets. In any case, the petroleum industry is mostly 
self-financed, having drawn only 6% of its funds from capital markets during 
the ten years preceding 1963.7 4° 

It is apparent that where the oil industry has grown to prominence you 


find bigness and vertical integration. The largest international oil companies 
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are concerned with maintaining their own positions in the world oil business. They 
look to permanence; they are in business to stay, and ideally to stay with a big 
share. 

The oil companies have, in general, remained oil companies, their 
profits being either distributed or ploughed back into one aspect or another of 
the oil business. Few have put their profits into other enterprises, thus facilitating 
expansion by vertical integration. Eight major international oil companies are 
presently responsible for over 80% of crude oil production, 71% of refining capacity, 
35% of tanker ownership and approximately 70% of the distribution and marketing 
of oil products.* 

In 1965, this major international group of companies produced 4, 761,790, 000 
barrels of oil in countries with net exports. If one assumes an average profit of 
50 to 65 cents per barrel on this oil, operations in these exporting countries 
accounted for 65% to 84% of the group's total profits of 3,665 million dollars 
during 1965.°,° 

Outside the United States, Mexico, and Russia, the operations of 
these major oil companies are combined through various intercompany holdings 
in subsidiary and affiliated companies. These holdings constitute partnerships in 
various areas of the world. Each of these companies has pyramids of subsidiary 
and affiliated companies in which ownership obviously provides opportunity, and 
even necessity, for joint action. Cooperation is achieved through such indirect 
means as interlocking directorates, joint ownerships of affiliates, intercompany 
crude purchase contracts, unitized production arrangements and marketing agree- 
ments. 

Governments have, increasingly, begun to participate directly in oil 
operations. Countries in which government participation in oil is a complete 
monopoly include all the communist world, as well as Mexico, Brazil, Chile, and 
Taiwan. However, many other countries hold monopolies in certain sectors of 
their economy through various state enterprises. Thus, Spain monopolizes marketing 
while it admits private-enterprise companies to exploration, and Italy holds 


a virtual monopoly over natural gas production through ENI. In all, a total of 
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68 national oi] companies were operated by various governments around the 
world at the beginning of 1965. Governments have elected to cooperate with 
private-enterprise capital in arrangements which are mutually beneficial. 

Governments have also elected to form partnerships among themselves. 
An example of this was the formation of Organization of Petroleum Exporting 
Countries (OPEC) in 1961. This organization is dedicated to maintenance of the 
level of posted prices. Crude oil prices are of major importance to these govern- 
ments, since their tax and other revenues from oil depend on price.’ 

Thus, at each level of the industry, there are a number of activities 
carried out, each of which probably relates in some way to the rate of utilization 
of domestic oil resources for any region. The fact that a few major oil companies 
have a significant share of production and market may cause interaction in the 
activities between regions. In any case, shut-in production for one region has 
always been compensated by accelerated production elsewhere. 

An ever quickening pace, however, has generally characterized the 
oil industry, regardless of reason. Per capita demand for petroleum has increased 
in major consuming countries, increase in population compounds the increase in 
demand. More and more producing areas thus have become established to meet 
the need. Initial production, while low, usually increases to some maximum for 
the area and then declines after the area is fully explored and developed. This 
has been the case in a number of European countries, and for some areas in the 
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mcURE 2.1.1 


ASSETS OF ALL MAJOR OIL COMPANIES COMPARED 


WITi4 CANADIAN INDUSTRIAL FIRMS 


1966 
OTHER INTEGRATED ASSETS 
OIL COMPANIES (Millions of Dollars) 


13,5 13,750 


13,250 - 13,500 | 

“7 18, 000 ~ 13,250 
~ 712,750 ~ 13,000 | 
12,500 - 12,750 | 

~~ 12,250 - 12,500 | 
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‘(1,750 = 12,000 | 
11,500 - 11,750 || 
11,250 - 11, 500 | 
11,000 = 11,250 | 
| 7 10, 500 - 10,750 | 
10,250 - 10,500 
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8,000 - 8,250 |i | 
7,750 - 8, 000 ||| 


7,500 - 7,750 | 
| 7,250 - 7,500 || 


| 7,000 - 7,250|/ Il} 


6,750 - 7,000 |! 


6,500 - 6,750 


6,250 - 6,400 | 
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FIGURE 2. 1. 2 


"CARTEL' SHARE OF PRODUCTION AND/OR OWNERSHIP OF EXPORTED OIL 
1965 
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SECTION TWO = OIL INDUSTRY ACTIVITIES 


In the integrated oil industry, there have traditionally been six distinct 


activities in the utilization of petroleum, as follows: 


l) Exploration: This is the search for pools of oil and/or gas. Exploration 
operations include: aerial surveys, geophysical surveys, geological 
studies, core testing, drilling of test wells (i.e. - wildcat wells), and the 
acquisition of mineral rights or permits or reservations from those holding 
mineral rights. 

2) Development: Development is the drilling and/or bringing into 
production of wells or a field, following exploration activity and the 
discovery of a pool. Development activities include development drilling 
and the nrovision of equipment and facilities necessary for the efficient 
extraction, gathering, and conservation of the petroleum. 

3) Production: Production is the act or process of producing oil or gas. 

The act of production usually involves royalty payments to the owner 
of the mineral rights. 

4) Transportation: Transportation is the act or process of transporting 
petroleum products such as crude oil, natural gas, etc., to or within a 
rnarket area. The distribution of refined products is not generally 
considered part of the transportation activity, even though some areas 
do have pipelines for refined products, and some are shipped by boat. 
However, the bulk of refining is increasingly carried out near a market area. 
Transportation commonly involves crossing of political boundaries, so that 
in addition to direct transportation costs, import duties, quotas, and other 
restrictions may be associated with transportation. 

5) Refining: Refining is the act or process by which the physical or 
chemical characteristics of petroleum or petroleum products are changed, 
exclusive of the operation of placing petroleum in settling tanks to remove 
sediment and water, or passing petroleum through separators to remove gas. 


The refining process is employed to upgrade oil into a vast number of more 
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valuable products; nearly all oil is refined prior to eventual consumption. 
6) Marketing: Marketing is the final handling, selling and delivery of 

petroleum products which have been refined and adapted to the use of 

the consumer. | 

Some of the steps of separating and treating petroleum products into their 


finished form have been illustrated in Figure 2.2.1. 


REDE REIN Ce: 


1. Gasoline Marketing Enquiry Committee, Gasoline Marketing in the Context 
of the Oil Industry, Queen's Printer, Edmonton, Alberta, 1968, p. 421. 
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SECTION THREE - SOME FACTORS RELATING TO THE DISCOVERY AND 
UTILIZATION OF OIL. 


The dynamic relationship between some factors (each of which describes in 
part the process of discovery and utilization of oil) has been charted in the simplified 
flow diagram in Figure 2.3.1. 

The initial boom in the industry continues to be manifest, but to a 
diminishing extent, as the resource is depleted. The most accessible sedimentary 
basins are naturally among the first to be explored. In these basins, the largest or most 
accessible fields are more readily found, and thus brought into production first. While 
these larger fields are being gradually depleted, somewhat smaller (and/or more remote) 
fields tend to be discovered; thus, this process is repeated over a long period of time.? 

Improved secondary recovery, and the use of drilled facilities which may 
have been financially depreciated to the write-off stage, tend to extend the life of 
many fields well beyond their original expectancy.? However, the amount of effort 
necessary to produce the entire recoverable resource may increase as the last barrels 
of oil are produced. As time goes on, much of the capital accumulated will either 
be used to find and produce oil in other areas, or be applied to other ventures. 

Towards the end of the life of a field, each new barrel of proved reserve 
would be produced almost as soon as it is found. If current production costs alone 
become greater than the value of the oil, physical production may cease. 

Sporadic efforts to apply new techniques for secondary recovery would draw 
out more oil, but at an ever diminishing rate - until that time when all the resource 
was used. The life cycle for other sedimentary basins and for fields within these 
basins would be parallel - unless they were never discovered. 

The strong economic forces, which in the beginning, have activated and 
nourished the growth in production, would no longer be great enough to sustain further 
production. All the accumulated wealth, and economic resources, would be subject 


to some law of diminishing returns, and so would be directed to other ventures. 
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RECEREINCES: 


la J. J. Arps and T. G. Roberts, Economics of Drilling for Cretaceous 
Oil on East Flank of Denver = Julesburg Basin, Bulletin of the American 
Association of Petroleum Geologists, Volume 42, Number Il, November, 


1958; p. 2549 = 2566. 


Ze E. B. Miller, Jr., "Old Oil Fields Never Die", Economics of the 
Petroleum Industry, Volume 3, Gulf Publishing Company, Houston, 
1965, pp. 137 - 150. (A postscript to the title might be "They Just 
Drain Away"). 
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CHAPTER Il 
RESULTS @rePreyLOUSssTUDIES 


SECTION ONE = INTRODUCTION 


Numerous estimates of the ultimate resource of crude oil in particular 
regions have been made. While most have been for the United States, a few 
have included other major producing countries. 

Perhaps the most widely known studies have been carried out by Lewis G. 
Weeks, who was a petroleum geologist for Standard Oil of New Jersey. Weeks 
used all information which was available, such as statistics about production, 
reserves and wells drilled, geophysical readings, along with any other data access- 
ible, asan aid in arriving at an estimate of the volume of sediments and the 
ultimate recoverable resource of oil in all known producing areas of the world. 

Figures 3.1.1 to 3.1.6 are graphical summaries of some estimates which 
have been made of the world resources of fossil fuels. The estimates of world crude 
oil resources were based mainly on the studies by Weeks.* #2 

The results of such studies, which are sometimes based on knowledge of 
a variety of information, are of great interest. Presumably a properly devised 
dynamic model should lead to similar results. To the extent that an adequate 
amount of reliable data is included in the model, a projection should be obtained 
which may in time prove to be fairly valid. 

According to C.L. Moore, "The projected patterns of petroleum industry 
activities .... are the conceptual patterns of these future activities. It is certain 
that these projected patterns do not accurately portray such future activities. 
Rather, they provide a reference base or guide from which to judge the adequacy 
with which such projected activities will meet the projected requirements for 
domestic petroloum, ..eee © 

" The signficance and value of projections of future activities depends on 
their probable reliability. The reliability of any projections of historic patterns 


into the future must be assessed from two completely different approaches -- the 
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conceptual and the statistical. If the concept of continuity is accepted, i.e. 
activities are valid criteria of the future patterns of such activities for a useful 
period of time, then the only question is the reliability of the method used for 
analyzing and projecting the historic patterns. Conversely, if this basic concept 
of continuity is rejected, then all methods for projecting historic patterns are 
useless. Acceptance of the opposite concept implies that the future pattern 
of established activities cannot be predicted. 

"If the concept of continuity in the historic pattern of relevant activities 
in the petroleum industry is accepted, then the reliability of the method used for 
the analysis and projection of such patterns can be appraised by established 
statistical criteria and techniques. "? 

In the process of utilization of the resource, several indications of the 
probable course of events may emerge. Unfortunately, in the early stages of 
utilization, there is sometimes little on which to base any sound projection, and 
while all data are probably relevant, some indicators are probably much more valid 
than others. 

Measures of production, proved reserves, oil in place, and wells drilled, 
form the basis for projections that have been made concerning the ultimate resource 
of crude oil. When the latter two measures alone are used as a basis, problems 
artse in the estimation of the economics, or timing of utilization. On the other 
hand, when production, or proved reserves are used as a basis, the projections 
obtained may differ unrealistically from geological estimates of the amount of oil 
in the ground. No study has related the measures to each other in an analytically 
consistent model with meaningful cause and effect relationships. 

The geological studies tend to ignore economic and political factors. And 
because of the relatively short-term perspective and fairly urgent nature of 
activities associated with proving reserves and producing them, projections based 
strictly on production and proved reserve data may lead to grossly differing results, - 


according to the kind of model used. 
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In order fo commence an analysis of the discovery, production 
and utilization of oil, most studies have made several simplifying assumptions. 
Interactions between regions were usually not included in the analysis, and it 
was assumed that in a particular region, trends in discoveries and production 
followed a unique pattern which could be determined for that area. 

In general, it was assumed that the ultimate resource of the i region, 
R., could be determined from a projection of the time trend in discoveries, 
production, and/or other factors, x0, so that as t > ™, 


R. was implied by some function of (x. Xeot Xian ve Xin t) 


where j = the number of factors for which historical data were available. 

If, for some factor, reliable data were available encompassing a fairly 
long period of time, it was sometimes possible to use these data in a special case 
of the equation above. Some factors which met that requirement are discussed 
below, along with a summary of studies dealing with any projections of the data 


to the ultimate recoverable resource. 
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SOURCE: See Reference 6. 
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SECTION TWO - PROJECTIONS CF CUMULATIVE PRODUCTION 


In a study for the United States Department of the Interior, C. L. Moore 
used the Gompertz curve to project cumulative production, y. Cumulative produc- 


tion was therefore assumed to be described by the equation 


eae (357.1) 


where c and d were constants, with In| Inc] establishing the time 


origin and R the upper asymptote of y.*’* 


The differential equation from which the Gompertz equation can be 


derived is 
aye 
p= Ya-ky nf), 
with =- Ind, a rate parameter. (37252) 


When plotted against time, this growth curve has an early inflection 
point, anda long tail to the right, with ¥ approaching zero as y approaches R. 
In 1966, Moore used data for oil production from 1920 - 1965, and estimated the 
ultimate resource of crude oil in the United States to be 434 billion barrels. 
(Actually, Moore projected rates only to 1980; R was one of the parameters determined 
in this projection.) * 

Moore's method of fitting the Gompertz curve to the data was to plot A 
log y against log y, where Alog y as defined as the difference be tween values 
of log y for successive years. The constants for Equation 3.2.2 were then obtained 
from the regression of Alog y on log y.* 

The result obtained using production data was consistent with estimates 
of the cumulative discoveries of crude oil in place in the United States. These 
were estimated at 330 billion barrels as of the end of 1960, and 38] billion barrels 


at the end of 1966.°’° Projections of estimates of crude oil in place are dealt 


with later in this chapter. 
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Moore pointed out that cumulative discoveries are calculated from: 

a) estimates of proved reserves, and 

b) data on actual production. 

He, and others, considered the facts about actual production superior to 
the figures for reserves in any derivation of resource estimates, 7° 

Moore states that "The primary criterion (for a measure of the historic 
pattern of activities in the oil industry) is the adequacy and accuracy of the 
basic statistical data used. In recent years, a few significant economic studies 
by competent analysts have been published in which the many variables affecting 
domestic petroleum supply have been related in mathematical form and by 
statistical techniques. As the authors of these papers have stated, their 
econometric studies have been severely handicapped by the lack of basic data 
both in quantity and quality" 

Annual data on domestic petroleum production in the United States were 
available for any period from 1859 to the present; data on proved reserves 
exists only from 1920 onward. 

Similar information concerning activities such as expenditures on explor- 
ation is not available, and even if it were, there would be a problem in determining 
the proportion of expenditures for oil and the proportion of expenditures for gas. 

Also, reliable information on imports and exports is difficult to assemble, 
because much of the crude oil is partially (or even completely) processed into 
a variety of products, any of which may be exported (or imported) in substantial 
quantities. Some indication of this flow of products from well head to consumer 


is shown in Figure 2. 2. 1 . 
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SEC IIOMPTAREE = PROJECTIONS OF CUMULATIVE*DISC OV ERIES 


In a study for the National Academy of Sciences, M. King Hubbert 
assumed that the logistic equation described the growth equation of cumulative 


discoveries, q, so that 


-at (35,091) 
e 


where 4, was the asymptotic value to which q would tend as time, t, became 

large, while a and b were positive constants, with b determining the time origin.’ 

By definition, q is the sum of proved reserves, r, and cumulative production, y. 
The differential equation from which the logistic equation can be derived 


with lower asymptote zero is 


“A= = kq (400-4) (3.3.2) 


where k = 4/4 is a positive constant, or rate parameter.” 

The method used by Hubbert to determine the constants in Equation 3.3.] 
was to plot ((qoo/q) - |) as a function of time on semilogarithmic paper using 
various assumed values for q%°. The value of q which resulted in the straightest 
line gave the solution to Equation 3.3.1. The constants were then obtained from 
the equation for this line.~ 

The ultimate resource of crude oil in the United States was estimat ed 
by Hubbert to be 170 = 175 billion barrels - rather conservative figures in 
cornparison to the resource base estimated by others. The result implied there were 
only 76 billion barrels to be found in all future discoveries .* 

Hubbert assumed that any proved reserves found would be produced in 
10.5 years, as had been the case for some time in the United States. Thus, 
cumulative production could be determined by using a time lag of 10.5 years 


in the logistic equation.” 
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An economist for Standard Oil of New Jersey, J. M. Ryan, criticized 
the method used by Hubbert on the basis that various values qg could be 
selected according to the time span of data used. For example, there seemed to 
be a change in the trend line for ((aa9/q) - 1) during 1930 to 1940.° 

Ryan applied Hubbert's method to the 186l to 196! data, which were 
available for production in the United States. There appeared to be a change 
in trend line after 1881, and again in 1931. Using production data from 1881 to 
1961, the value for qao which produced the straightest trend line was 100 billion 
barrels. Ryan therefore contended that an allowance for changes in the trend 
of discovery and production had not been incorporated into the method and that 
a wide range of figures could be chosen for qoo ; therefore the technique used by 
Hubbert could not be established as valid.” 

Changes in trend for cumulative discoveries and cumulative production 
in the United States are also apparent in Figure 2.1.4. For the period 1930 
to 1940, there is no apparent change in the trend for world production. The 
United States case could probably be explained by the discovery of large amounts 
of crude oil in place in 1930, and again in 1936 to 1938.° The trend change 
in 1880 could only be explained by some very early inflection point, because 
continued (logarithmic) growth at the 1860 to 1880 rates would lead to very 
high estimates of qo, 

Hubbert's reply to Ryan was that his results were consistent with 
the fact that cumulative discoveries had reached an inflection point in the 
United States between 1940 and 1950, and that proved reserves (which precede 
production) had probably reached their maximum by 1960.°’*° Hubbert 
contended that because proved reserves last an average of 10.5 years, production 
was therefore directly linked to proved reserves, with cumulative discoveries 


reaching a maximum before cumulative production. 
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However, since 1900, while the average life of proved reserves has 
been II.| years, the life of proved reserves in the United States has ranged from 
9 to 15 years.” The relationship between cumulative discoveries and cumulative 
production is therefore not a simple one, and probably depends upon a variety of 
factors. 

It is sometimes possible to obtain cumulative discovery estimates classified 
according to the year of discovery of fields. These estimates are called proved 
discoveries for fields according to their year of discovery.*” 

Proved discoveries are sometimes called ultimate recoverable reserves 
by year of discovery, and in Canada proved discoveries are usually called proved 
ultimate reserves by year of discovery. Proved discoveries may also be classified 
by the geological type of reservoir in which they occur.” °’* 


Cumulative discoveries can be related directly to proved discoveries 


with: 


qi =) u(t), (3.3.3) 


where q(t) = cumulative discoveries estimated at year t for a region (such as the 
United States), 
u(t) 2 proved discoveries for fields which were discovered during year i; 
ie I aleaisieey eieee te 
u(t) = proved discoveries for all fields discovered up to and including year 
N, and 


N = the first year in the series of data reported. 


From Equation 3.3.3, it is obvious that U; (t)must be equal to the 
cumulative production from fields discovered during year i, plus any proved 
reserves remaining in these fields under the recovery conditions which existed as 
of year t. 

With proved reserves estimated for 1964 recovery conditions, Moore 


used the Gompertz equation to project 1920 - 1959 proved discoveries in the 
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United States to an ultimate value of 139 billion barrels. 

This rather low estimate can be explained in part by the fact that proved 
discovery estimates appreciate considerably with time. It can be shown that 
(n +1) years after discovery of a field, the relative appreciation in estimates of 


the proved discoveries can be rather closely approximated by a function of the form 


U{otIau(n) _ b(l=-e-) 


u (n) ol ae (3.3.4) 
t-|-n 

where — y (n) x ui (i), (orc.5) 
in 


u(n) = gross proved discoveries, aggregated for all discovery years 
n years after discovery (sometimes called ultimate proved 
reserves), 

u.(i)= proved discoveries as estimated at the end of the jth year 
of production, 

Wed 0 |e 2h ieatpet=2 > 


andb =A/(atA). 


A and a are positive constants; A represents the appreciation of 
proved discoveries, and a represents the decline of appreciation in proved 
discoveries as proved reserve estimates are updated from year to year.” ° 

The relationship of the constants A and a in Equation 3.3.4 has been 
shown in Figure 3.3.1. The constants can vary considerably according to the 
kind of proved reserve data being used; more conservative proved reserve data 


would lead to a larger value for A. Some typical values are A=1.9 anda= 0.1 


for proved reserve estimates in Alberta during 1948 to 196l. 
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SECTION FOUR = PROJECTIONS OF DISCOVERIES OF OIL IN PLACE 


Estimates of the amount of crude oil originally in place, according to 
year of discovery, are now published annually for the United States and Canada. 
The Canadian estimates are for discovery years from 1947 and onward, and the 
figures for the United States start with discovery year 1920. The cumulative 
total of the original crude oil in place, estimated for each discovery year up 
to year t, is the cumulative discovery of crude oil in place, w(t). This amount 
is revised (usually upward) as more information is obtained about fields previously 
discovered. The relative amount of appreciation is usually greater for those fields 
most recently discovered, and is a reflection of the greater rate of appreciation 
in proved discoveries for a field during the first few years of production.*/* 

In summary, if at year t, v. (t) = the estimated original amount of crude 


oil in place in fields discovered during year i, 


Pa Ne INE INE) poe 


then wit)= 5 v.(t) (3.4.1) 
i=N 

where N was the year production first began, and where w(t) = cumulative 

discovery of crude oil in place, as estimated as of the end of year t. 

In a study made in 1962, Moore adopted a fifteen term formula to 
smooth the variations in v which occurred with time. Moving averages tended 
to blanket the original data, and did not provide an acceptable fit with the 
original daia; the fifteen term formula retained the inherent cyclical 
variations.~ 

Moore then plotted v against w, and assumed that v increased linearly 
with w up to I4l billion barrels (during the period 1860 to 1932) and then 
decreased linearly with w, as w increased to a projected terminal value of 
486 billion barrels. This led to a simplified exponential formula 


062886 n_ 


v(n) = 1700 (e l) (37472) 
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for n = firme in years from 1860 for the realtionship up to 1932; for the 
relationship after 1932 
Oe cuales, 8) (3.4.3) 
Estimates of crude oil in place provide only a basis for calculating 
proved reserves. The next step is to project the anticipated percent recovery 
of crude oil oviginally in place. 
Increases in percent recovery are probably brought about by improved 
technology, and by the increasing necessity to extract as much of the resource 
already discovered, rather than to prospect for more oil. To the extent that 
increases in percent recovery are a direct consequence of improved technology, 
percent recovery might best be described by some growth curve with time as the 
independent variable. The percent recovery would be low and nearly constant during 
the first years of production, but would likely rise asymptotically to some value 
less than 100%. This would likely lead to a fairly complex relationship between 
cumulative discoveries of oil in place and production. 
As outlined in the earlier report by Moore, the second step (after projecting 
oil in place) was to establish the historic pattern of the percent recovery of crude 
oil in place.® 


Moo-e used a modified logistic equation for this purpose. He assumed 


per.= pm. | (3.4.4) 
lS ena, 
where 


p.r. = cumulative percent recovery 


p.m. = maximum ultimate percent recovery 


t = time in years from January |, 1930 

a =a variable derived from the equation 

| | = 

ik a eas ce a ae (3.4.5) 
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Using !5 percent as the lower bound of p.r. (for the years up to 1930), 
Moore tried several values of p.m., and found that a value of 75 percent provided 
the best corretation with actual data for the years 1937 - 1961. This result seemed 
to Moore to be reasonable enough, and so Equation 3.4.5 was used to describe 
future recovery for the United States. At 75% recovery, the ultimate recoverable 
resource of crude oil was 344 billion barrels. © 

The third step was to calculate projected cumulative discoveries from the 


results obtained in steps one and two, since 


Ditise = Git) by definition.” (3.4.6) 
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The fourth, and final step, involved the establishment of the historic 
pattern of proved reserves. From this, the trend in cumulative production could 
be obtained directly. 

Moore tried various curves and formulas to match the pattern of cumulative 
production when plotted (linearly) against cumulative discoveries; assuming the 
following boundary conditions: 

1. When cumulative production was zero, cumulative discoveries were 

also zero. 

2. In the final year of production, the last barrel of proved reserves 
would be produced, so that both cumulative production, and cumulative 
discoveries, would reach 364 billion barrels, (i.e. 75% of the oil 
in place) and no more. 

A catenary curve, which passed through the points (0, 0), (354, 364) and gave 
the best correlation between y and q for the series of years 1930 ~ 1961, was finally 


adopted. The pair of equations for the formula used were 


r=b - /2 a cosh (p/a) (3.4.7) 
and Ap = 366 - 2 (q-b) (3.4.8) 
where a and b were positive constants, 

p =a variable linking the two equations, 

r = proved reserves, 


; . 4 8 
and q = cumulative discoveries. 
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The above three measures of physical production, proved reserves, and 
discoveries of oil in place, have formed the basis of most of the projections using 
time dexendent growth functions. While there are other factors which would be 
relevant to the rate of utilization of crude oil, few measures are published for 
as great a time span as the discoveryand production data. There are even fewer 
measures published for the major areas in which oil has been discovered and produced. 
The various activities of the oil industry can be related directly to data 
on discoveries and production of oil. Inparticular, exploration experience in one 


area may be applied to other (less explored) areas, as discussed in the next Section. 
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SECTION FIVE - ESTIMATES FROM WELLS DRILLED 


Many estimates of the likely benefits from exploration programs based 
on previous successful wildcat wells and discoveries from these wells have likely 
been made. Probably most of these investigations have not bem published because 
they were based partly on a company's own experience and knowledge gained 
from their exploration activities. 

Such studies are not strictly concerned with the overall rate of utilization 
within any political area. They are usually made as an aid in deciding among 
exploration programs which promise an adequate return on the capital invested. 

Furthermore, the use of base data on discoveries (which may not yet have 
been fully appreciated to their ultimate value) from other regions may lead to 
serious errors in long term projections of the ultimate potential of a fairly new 
basin. Therefore, adjustments usually have to be devised to estimate the long term 
economics of production in the area. 

Arps and Roberts postulated that in any sedimentary basin, the estimated 
number of additional pools, AT (of size between the limits ® and (b+ A®) 
in that basin was jointly proportional to the number of additional wildcats drilled, 
Aw, the number of undiscovered pools of that size remaining, Ty - T,and 


the average areal size of such pools, , or 


aD 


ae = Cop (lo ale (3.5.1) 


By putting Equation 3.5.1 in differential equation form, and integrating, Arps 


and Roberts obtained 
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where 

®@ = average areal size of the pools being considered, 

IT. = the number of such pools discovered after a given number of 

wildcats, W, had been drilled in the basin, 

Ty = the original number of such pools before drilling had started, 

Y = the number of wildcats drilled in the basin, and 

¢ = aconstant, depending upon exploration technique. 

It should be noted that Equations 3.5.1 and 3.5.2 are not dependent 
upon time. 


Arps and Roberts classified pools into fifteen groups according to their 
estimated ultimate recovery. The limits for each group were chosen so that each 
interval was twice as large as the interval for the preceding group. The smallest 
group had an estimated average ultimate recovery of 1,700 barrels, and the 
largest group had an estimated average ultimate recovery of 50,750,000 barrels 
of crude oil. 

The average productive area of pools was calculated for each group 


using the relationship 


yp = 530 (6) baa! 24 (3.5.3) 
where wy = estimated average ultimate recovery of pools, 
and © = average areal extent of pools in acres. 


A (smoothed) histogram of the frequency of the 338 pools as classified 
into fifteen groups formed the basis for estimating the effect of drilling additional 
wildcats; Equations 3.5.2 was used to estimate the incremental number of pools 
(for each of the fifteen groups) which would likely be found by drilling a given 
number of wildcat wells. 

Arps and Roberts also analyzed the data on the average number of dry 
holes, N, drilled during development of fields to delineate pools of average 
areal size ® acres, and found that 

Nie, 70 ae ; (35534) 


closely approximated the relationship .* 
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Kaufman provided a probabilistic adaptation to the deterministic model 
proposed by Roberts and Arps. Kaufman assumed that the volumetric size of pools 
within a field (or fields) would be lognormally distributed if all pools were found. 
The bivariate lognormal distribution was assumed to describe the areal extent in 
acres, and average net feet of pay of all pools which might eventually be found 
in the field. 

If the above assumptions were true, then when a number of pools of 
various sizes had been discovered in a field, an estimate of the number and sizes 
of pools remaining to be found could be made. Kaufman postulated that by 
studying successive frequency distributions of pool sizes for several stages during 
the development of a field, some idea of what might be expected if more wells 
were drilled might be obtained. Also, if the true size of the overall field was 
derived by geological estimates and by applying subjective judgements, the 
eventual distribution (which would be asymptotic to the lognormal) might be 
estimated .* 

Kaufman planned to study some of the problems of projection in 
cooperation with Paul Bradley of the University of British Columbia.? Mr. Crabbé 
of the National Energy Board is also studying this problem .* 

Because of the lack of any other means of estimation, several studies 
have employed the results obtained in fairly well explored areas of the world, 
to assess potential resources of oil elsewhere. The authors of these studies have 
almost invariably warned of the gross errors which may be inherent in such 
estimates; the following quotation from a report by T. A. Hendricks clearly 
states some of the problems involved: 

"Geological science has been very successful in classifying parts of the 
earth's crust as favorable, unfavorable, or impossible for the occurrence of oil 
and gas, but the exact location and extent of the occurrences can be determined 
only by drilling. By the same token, only extensive drilling and production 
experience can provide the statistical sample needed to attempt quantitative 
estimates of the producible quantities in the unexplored favorable parts of the 
earth's crust. Moreover, much of the purely geologic information needed for 


intelligent appraisal of unexplored rock can be provided only by drilling. 
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"At present, only a tiny part of the geologically favorable rock in the 
earth's crust has been explored by drilling. The amount of exploratory footage 
drilled in the United States dwarfs the total footage in the rest of the world, but 
even in the United States there is "room" in the geologically favorable rocks for 
more than six times as much exploratory drilling as has been done so far. Until 
a far larger "sample" has been drilled, estimates of undiscovered quantities will 
be subject to major revision".® 

However, Hendricks pointed out that some estimate of the ultimate amount 
of crude oil in place could be obtained if the following factors were known, or 
could be estimated: 

1. The quantity of ofan place found by exploration to date. 

2. The fraction of the potentially productive rocks that have been 

explored to date. 

3. The estimated incidence of oil in the unexplored part of favorable 

rock, as determined by comparison with the explored part.® 
The actual amount of crude oil in the ground would be related to a number of 
factors. With 328 billion barrels of oil in place having been reported as of 1960, 
Hendricks assumed that there must be at least 400 billion barrels in known produc- 
ing fields in the United States, which would eventually be reported through 
extensions and revisions.” 

Hendricks then grouped oil producing states into three categories, 
according to their productiveness. He assumed the incidence of oil in the 
unexplored areas of each category to be: 

i equal to the average of all explored areas, (i.e. the average for 

all states) for the most productive states. 

2. zero for the least productive states. 

3 the average of 1. and 2. for all other states. 

On this basis, Hendricks arrived at an estimate of 1,600 billion barrels of crude 


oil in the ground in the United States (excluding Alaska) .® 


¥ ay? % me ah ni duis 1 he wet 


hlineonas Se: sigalong dete on a 


va 


st : , ; 
ry akieceaale mihleadios 1005's co Abu —. 


- mn ' +4 
Div soli onup! hore venti 16 etait tr ik toa store 
a * =. Salish vojeen @! 
ae : a ptpentitad well Pe tnmitie aide tor'l rhe Leotivienny agitator ear ees 
- - w¢ 


i} beritwide a Blvds adel wt Tis ob ato 


Sd copiridninibhoes chat viata al eae neene bat 
: rend ever tpt esos ey Hower giteiod. oa TENy ote a enl 


‘se Wigi 4 ale ‘hk 


en) a wt 
Vetdesbrat le tong bbabtynt res aytt Wi Be toes ont “ASht Westone gt 


_ 4yog bis col pate A) i ‘| iA { i (ro A banin far ai» ei iba sory 


s vr 


7 ‘i } 
s ‘ih . a i Pen a Re al ey ry 
wn By raalewary eo coer P vil cont! at Tydbuys 167m ony leutan eee 
7 ? 


WRT lp to bevogn) 0846 give wiv ni Tie bo otra antl ha GRE en wih ont 


7 i ‘ ; 7 = 4 
a mavborg radar ca: he mi ov) feng fennel oni tilt hesinys wes elais ot 
eo 
BP ice . ; : 
. > | i Higa even vel be notre] PALE | | vane Binoy sys WA ar a] ipl? be! in! eth ei ibisti'gn 


| : Ve oraptei vey bray sacteae 
: 7 _ 
' ‘ 


z ii ‘a : * an. ¢ f= 7 ‘ 5 4 
pyeetingdiao maWlPetnl eatarte ecto nrot fic CM4 uae Gell etalibont! 7 


_ 
7 
- - . . - Hi j La i] i. - = 

ene ai lie Io sort yi! Sr" me ; of Thee MS Gey rt i ad ae FOE 
if { 


» ‘ 1 PI = 
; eda! yada Hope to aed: ‘bali re 
ae Al - _ ~ 
Wi SGoIovy Fr +l) eee ah é Ny lovapeowl nl ot: invpe ves i 


ae techn cht wierte Ye. oe 


: oy a 
ikainiacev Tt ou banat ri aot ooh omdis 


| a paaig tw? 2 Lijts - if 
100%, 1% oipinites: ok % Asie 


"ames [ dio 


yi a ag 


ime ig 


«45 4 


Assuming that one third of the (five sixths) remaining unexplored area 
would be drilled before exploration became uneconomical and ceased, and further 
assuming that the incidence of oil in the areas to be drilled would be 75% of 
that in the explored areas, Hendricks finally arrived at an estimate of 1,000 billion 
barrels for the ultimate resource of oil in place. ° 

With the above figures for the United States as a basis, Hendricks went on 
to estimate the world oil in place at 10,000 billion barrels, of which 6,200 billion 
barrels would ultimately be recoverable, This estimate was based on a number of 
assumptions about the relative productivity of various areas of the world in comparison 
to the three major categories (according to productivity) established by Hendricks for 
the United States, *° 

Weeks contended that "petroleum resource estimates, ... can only be 
determined by correlating the facts of oil occurrence with geology, as revealed by 
the worldwide exploration experience of the industry". | He points out that the 
incidence of oil varies from an average of Jess than one thousand, to several 
million barrels of oil per cubic mile of sediments. 

According to Weeks, the only basis for resource estimates is that of 
"exploration experience on the widest possible basis, scientifically analyzed, (and) 
supplemented by realistic research", He stated that "extrapolations of production 
curves", and "indiscriminate comparison of areas or volumes of sediments", may 
produce grossly misleading figures. ** 

Weeks suggests that the most valid indication of oil in a particular sedimentary 
forrnation can be obtained from a knowledge of its age and structural history. Seepages, 
other surface evidence of oil, geophysical data, groundwater movement, and results 
from mapping techniques may give some further clues as to occurrence, and yet, to 
quote an axiom of the industry, "oil is where you find it". 

Weeks also suggests that some scientific means must be developed to 
correlate all the factors which relate to the occurrence of oil. Most of Weeks' 
studies have not been concerned with trends in percent recovery, or on the different 
political and economic factors, which (while they have no bearing on oil incidence), 


certainly must relate to the dynamics of utilization. ” 
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With the experience of these studies as an aid, the task which remains 


is to develop a model of the ultimate utilization of oil which would tend to 


embody some of the political, economic, technological and geological factors, 


(assuming these factors can be expressed implicitly). This has been attempted 


in the next Chapters. 
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SECTION SIX - MULTIVARIATE MODELS OF DEMAND 


If one accepts the assumption that the ultimate resource of oil (R) 
is finite, there are few (if any) exogenous variables which can be linearly related 
to the domestic supply of oil. Similarly, any transformation of such variates in 
order to achieve a one-to-one relationship to y (or Y) would require some 
Herculean assumptions. For this reason, multivariate models are usually used 
for projections of energy demand rather than for estimating crude oil supply. 

So far as each firm is concerned, such studies are certainly 
necessary in order to project demand for crude oil and petroleum products. 
Usually, the time span for these projections is restricted to a period of less than 
thirty or forty years. For projections of demand for products (such as gasoline) 

a much shorter time base would likely be used. 

Probably a large number of (short term) projections of crude oil 
and oroduct demand have been made. It is instructive to describe the technique 
used in multivariate analysis, as illustrated by a recent study of the United States 
Department of the Interior. Some notion of the relationship between variables 
can be obtained, even though this writer feek that domestic crude oil production 
could be treated as an independent variable rather than a dependent variable--at 
least for a long term projection in which a substantial portion of the ultimate 
resource R would be produced. 

In the Department of the Interior study variables were divided into 
two groups, "the independent (exogenous) variables, and the dependent (endogenous) 


variables". Some variables were transformed logarithmically or exponentially. 
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The independent variables were believed to be: 
X1= Economic activity 
X 2= Population 
X 3= Industrial production 
X 4= Real costs and prices of energy resources 
X 5= Domestic supply 
X 6= Foreign trade 
X 7= Environmental restrictions 
X 8= Evolutionary technology 
~X 9 Revolutionary technology 
X10= Regional factors 
X11= Energy policy 
X12= Political considerations and trade-offs 
X13= Other variables accepted as given 
The basic model for total energy consumption was YI = f (X1,X2,X3,...,X13), 
where Y1 was the dependent variable representing total energy demand. 
The major components and subcomponents of the dependent 
variable for total energy were: 
Y 1=_ Total energy consumption, 
Y 2= Demand for bituminous coal and lignite, 


Y 3= Demand for anthracite, 


Y 4= Production of crude petroleum, 
Y 5= Demand for petroleum and natural gas liquids, 
Y 6= Production of natural gas, wet, including 


natural gas liquids, 
Y 7= Demand for natural gas, dry 
Y 8= Demand for hydropower, and 
Y 9= Demand for nuclear power, 


Whereryl=i¥oh You. Youu 7 sevortey 7. 
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The major energy markets or consuming sectors were: 


Z |= 
Z2= 
(Loe 
Z 4= 
Jig = 


Household and commercial sector demand 
Industrial sector demand, 

Transportation sector demand, 

Electric utilities sector demand, and 


Miscellaneous and unaccounted for, 


Where Yl = Z]+ Z2+ 73 + Z4 + Z5. 


The major forms of energy consumption were: 


N I= 
INEZ= 
N= 


Direct fuel uses, 
Utility electricity uses, and 


Raw material nonfuel and nonpower uses, 


where Yl = NI + N2+ N32. 


A number of composite cases were considered, including an 


analysis of demand for energy resources. In carrying out the projection to 


the year 1980 for crude oil production, assumptions were made regarding the 


percentage change anticipated in each of the independent variables for the 


period from 1965 to 1980; these assumptions were based on the results from 


the 1947 to 1965 data, ard on subjective judgements .+ 


REFERENCE : 


1. W.E. Morrison and C.L. Readling;, An Energy Model for the United 


States, Featuring Energy Balances for the Years 1947 to 1965 and Projections 


and Forecasts to the Years 1980 and 2000, Bureau of Mines Information 


Circular 8384, United States Department of the Interior, 1968, pp. 12-13, 
pp. 18-22, pp. 29-30, and p. 123. 
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CHAPTER IV 


SOME MODELS OF THE DISCOVERY AND UTILIZATION PROCESS 


SECTION ONE - PROBLEMS OF MODEL BUILDING 


The application of the scientific method to the solution of any problem 
initially involves formulation of the problem. The mere statement of a problem 
necessarily implies the kind of solution desired - otherwise the problem could 
not have been formulated in the first place. 

Problems which are never defined do not form the basis for applied 
scientific endeavors. On the other hand, the moment a problem is defined, the 
definition essentially comprises the first notion of the model which can be used 
to obtain an optimal solution to the problem. 

As any scientist (and practical person) knows, it is only common sense 
that his initial concept about a situation should be tested against some facts. 
This step will either prove the original notion to be true in both diagnosis and 
fect, or will lead to a more refined statement of the problem. 

This process can be repeated (perhaps over and over) until a clear 
theoretical understanding is odtained of both what the problem, and the solution, 
might be. The sort of intellectual activity involved, in searching (and researching) 


for an optimal solution, can be described as a three-step process, 


A. Design the Model: In thebeginning, the statement of the problem provides 
the initial framework for the design of the model, because the principles 
which apply to a solution are implicit in the formulation of the problem. 


B. Test the Model: The facts which apply here can sometimes be drawn from 


ne ee ee 


experience. However, in most scientific studies, observations and actual 
data are usually necessary to describe with any accuracy what happens, 
where it happens, and when it happens. 

C. Refine the Model: The facts gathered in the previous step can form the basis 
of how one's concept may be reformulated. Thus, an increasing awareness 
of why events occur in a certain way may lead to a redefinition of the 


model, and a return to Step A above. 
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This process may be repeated a number of times. Someone (identity 
unknown) once remarked that this iterative procedure could be called the “Las Vegas 
Method", in which improvements are made to the model until the benefits derived 
are less than the actual cost of making further improvements. 

In mathematical analysis, the computer can often carry out much of the 
optimalization® process. The final polarization of the problem into an analytically 
consistent, and succinctly descriptive method provides the basis for the next stage - 
that of generalizing the methodology for the use of others. 

This experiment of actually trying the solution, is sometimes called the 
“acid test", paraphrased by "Yes, but will it fly ?”. If the experiment is a success, 
then perhaps others can obtain similar results. Otherwise, it is a case of "back to 
the drawing board", to design a model which will predict satisfactorily. 

In the pure sciences, much theoretical framework can be built up without 
necessarily experimenting to test the results. Here, the models developed may 
provide a conceptual basis to describe a variety of events. However, in the applied 
sciences, where concrete objectives (or goals) are specified, the solution to the 
original problem can only be attained by application of methodology in the real 
world.” 

No doubt, the growing sophistication of technology plays a role in model 
development and utilization of models. Changes which occur in technology may 
bring about a need of refinement (or even realignment) of old theories. Conversely, 
new facts may find an unexpected home in theories developed many years ago. 

Economic changes, and changes in human values, also bring about 
reemphasis (and even restructuring) in the models used. When these changes 
occur, the process of model building is set in motion again. Here, revisions 
in the model may be necessitated by the overriding influence of some variable 


previously unrecognized, or perhaps discounted in importance. 


* Optimalization: Defined herein to be a systematic search of feasible solutions 
to find one which maximizes (or minimizes) an objective. 
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The steps from the definition to the solution of problems, by means of 
the scientific method have been illustrated in Figure 4.1.1. Application of common 
sense to everyday problems is very similar in nature to the use of the scientific 
method - excepting in the rigor of analysis required for model building. 

Occasionally, the task of searching for optimal solutions has been tackled 
using the black box concept. Here, given a clear definition of objective, the 
computer carries out the tedious job of finding the optimal solution. By means of 
feed-back mechanisms, the computer can even be programmed to take over the 
decision making process. While this is a necessary and valuable expedient in some 
situations, sooner or later, what goes on inside the black box becomes of ine. 

For the present study, the choice in model building was to favour traditional 
causal models. More complicated systems could be suggested in the light of any 
fundamental relationships between variables which became established. 

In a study such as the present one, with some benefit of hindsignt from 
initial results obtained, it would always be possible to repropose some of the 
models in a more elegant fashion. However, the purpose of the present study 
was merely to determine the contribution of some variables believed to be 
significant. Other studies would be necessary to more fully define the relationship 
of all the predictive variables to the production of oil. 

While there was a danger of failing to adequately depict the discovery 
and utilization of oil because an oversimplified model had been used, a first 
approximation might nevertheless be of some value. 

After some preliminary sketching of various functional relationships between 
the variables y, q, w, y and t, the task in this study became one of integrating 
these variables into a model which would serve as a comprehensive basis for 
estimating the recoverable resource of oil. Data for the United States were chosen 
for use in the models because of the amount of information available, and because 


other studies could be used in checking the order of magnitude of the results. 
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Some other areas in which previass studies had been made included:®+ ° 

1) Venezuela, and 

2) Alberta. 

It was decided that any models developed would be applied to only the 
United States in the present study. As described in Chapter Two, the oil industry 
is international in nature; more meaningful results might be obtained using the 
statistics for the following regions in the models: 

1) World, 

2 


Western Hemisphere, 


) 
) 
) North America, 
) 
) 


_. ow 


South America, and 
5) Eastern Hemisphere. 
The regional models should ideally be additive functions; ultimate resource pro- 
jections for regions should add up to the projection for all the regions. 

In this study, the models favoured were primarily deterministic in nature, 
because it was assumed that the recoverable resource of oil was 

1) some finite value, and 

2) determinate, in the sense that cumulative discoveries and cumulative 

production would (eventually) asymptotically converge on this value. 

Thus, if was assumed that a deterministic model would provide a basis for 
the discovery and utilization process. Variation of this model from the data 
would therefore be described stochastically - just as other purely random events 
which can be described stochastically, such as the fall of the dice or the spin of 
the roulette wheel. The deterministic model implicitly assumes that for a projec- 
tion of the process, all causal factors or conditions would exert a continuing 
uniform effect on the process. | 

The adequacy of a deterministic model should become apparent from an 
analysis of the randomness of variations. If the variations could not be suitably 
explained by conventional statistical techniques, then the basic analytical model 
might require further modification. 


The method of least squares was assumed to offer the most advantages in 


testing the goodness of fit of the models.” 
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The least squares principle 

1) leads to simpler statistical calculations and results, 

2) does not involve any other assumption about the distribution of 

the variations, 

3) proved to be essentially practical to use in computer algorithms 

even with non-inear relationships , and for this study 

4) in most models tended to emphasize the more recent (and probably 

more reliable data available) because of the general increase 
in the magnitude of variables with time. 

Computer algorithms which sought out the parameters in non-linear 
equations for a least squares fit were developed. Care was exercised in relating 
variables, so as to incorporate the notion of cause and effect in the model. 

The regression between variables does not establish that a causal relation 
exists. Indeed, no amount of observation alone could assure that cause and effect 
are at work. But without some kind of model of the situation, the relevance of 
variables cannot be tested. 

One hazard in model building is a tendency to rely on empirical or 
descriptive models. Descriptive models (born out of convenience) have been 
used frequently to characterize short-term dynamics. However, for long-term 
projections, it is far better to establish causal analytic models, which match any 
a priori knowledge of the situation and accommodate inferences about the nature 
of the process.» 

At this point in the study, it was useful to adopt the following notation 
for the main variables used in describing the process of discovery and utilization 


of crude oil: 


R = Ultimate recoverable resource 

y = Cumulative production at the end of the year t, 
= y (t), if the year t is used as an index. 

y = Production rate, 
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wa = Annual production during the year t, 
= y(t), if the year t is used as an index, 
= fy for convenience, since At =1 year. 
r = proved reserves at the end of year t, 
=r (t), if the year t is used as an index. 
q = Cumulative discoveries at the end of year t, 
= q(t), if the year t is used as an index, 
=ytr 
q = Discovery rate, 
3 Gal 
dt 
ras = Annual increase in cumulative discoveries, 
= Aq (t), if the year t is used as an index, 
= Aq for convenience, since At =1 year. 
U = Proved discoveries classified by year of discovery, as defined 
in Equation 3.3.3 (= u(t), with t as an index). 
Ww = Cumulative discoveries of oil in place at the end of year t, 
as defined in Equation 3.4.1, | 
= w(t), if the year t is used as an index. 
t = An arbitrary time base, in years. 
N = The year A.D. in which production was first started (or recorded). 


i, |, or k designated the year A.D. 

a, b, c, d, ... were parameters in the equations used. 

Wherever possible equations have been written with a, b, c, d, ...>0. 

In this study, a parameter was defined as an arbitrary constant or coefficient 
in a mathematical expression, which distinguished various specific cases. For 
example, ing =m y +c, mand c were the parameters, and they could be 


estimated by means of the regression of q on y. 
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A structural parameter was defined as a parameter in an equation which 
met certain boundary conditions. The structural parameters for such models could 
be estimated either by regression, or by some least squares technique, provided 
the boundary conditions were met. For example, in the equationy = my, with 
y =o for y=o , the regression of Y on y would be through the origin. It was 
assumed that the structural parameter R could be estimated from any model viable 
in long term projections. 

Some assumptions proposed for analytical model were as follows: 


1) y is a monotonically increasing function of time 


2) R is some finite value. 

SO yee: 

220 me 

5) y 20 for some finite time interval t, O S t Ss. 
Oli t=O 2 oy 20% 
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8) Ify = R,q = R, andy =" q = 0. 

9) A single maximum value exists for y. 

10) A eo acs value exists for q. 
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12) The functional relationship between the variables y, q and t could 
be expressed in a reasonably simple form. 
For the stochastic portion of the model, the following assumptions were 
made: 
1) observed values of Y(i), Q(j), and W(k), were available for 
consecutive years, 
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Q(j) = cumulative discoveries measured up to year j, and 
W(k) = cumulative discoveries of oil in place measured up 
to year k, 


2) y (t) and y (t) were the calculated values corresponding to the observed 
values Y (i) and Y (i). 

3) q(t) and q (t) were the calculated values corresponding to the observed 
values Q (j) and Q (i). 

4) w (t) and w (t) were the calculated values corresponding to the observed 
values W (k) and W (k). 

aca Yeindnd We : W (t). 
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7) The deviations (6 , 4, and D) of the observed values from the 


calculated values were normally distributed, 


where bay ALN, 
A =y -Y, and 
D=q-Q. 


Note that since q = y +r andQ = Y +r, then D = A, 

One might expect that the relative deviations of Y and Q from their 
calculated values ( y and q ) to be subject to some kind of damping if data were 
available for y and q as they approached R. For this reason, the magnitude of 
the deviations ( A and D) were assumed to provide a reasonable basis upon which 
to derive the parameters in a least squares computer algorithm. a 

The deviations in annual production (8), were similarly assumed to be 
random, and independent. Now, the rate equation for production (y) is a 
logical consequence of the equation expressing cumulative production (y) as a 
function of time. Therefore, the projection of y and y to their ultimate values 
should lead to essentially the same result for R, or a rejection of the assumption 


that a particular equation describes the rate of utilization of oil. 
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Any causal relationships which have been proposed have been expressed 
by designating the dependent and the independent variables; independent 
variables usually appear on the right hand side of an equation. In the actual 
regression equation the error term (residual) would be added to the right hand 
side of the equation. —_|t was assumed that the relative merits of each model 
could be estimated by various tests on the distribution of the residuals. These 


tests are described in the next Chapter. 
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SECTION TWO - MODELS DEPENDENT UPON TIME 
- CONTINUOUS TIME FUNCTIONS 


Long term models of the rate of utilization of oil must be bounded in 
cumulative production (or cumulative discoveries) as t becomes large. Unbounded 
models impute an estimate for R which does not exist. Some restriction must 
therefore apply to the magnitude of y (or q) in any model used to estimate R when 
time is the independent variable. 

The usual method for limiting y (or q) has been to choose an equation which 
is asymptotic to some finite value as t + co, Alternatively, any expression for y 
(or q) can be chosen, provided the general requirements described in the previous 
section are met. 

The annual statistics for Ay would provide a reasonable approximation for y 
in such equations. This approximation was useful for the derivation of comparable 
estimates of R, using different forms of the same basic model. A model in differential 
equation form should result in roughly the same estimate of R as would be obtained 
from the integrated form. 

In this Section, five analytical models which met most of the conditions outlined 
in Section One of this Chapter are described. With time as the independent variable, 
all these models were non linear, Because of problems which may be encountered in 
estimation of the parameters in the integrated equation, the models have also been 


expressed in differential equation form (and where possible, as a difference equation) 


1. Exponential Growth: The hypothesis made for the case of exponential 
growth was that the rate of production is a constant proportion of cumulative 


production so that 


Yomact/ (2s) 

The general solution of this equation is 

y=ke", (2) 
or my = &k +ct, (eae) 


where cand k are positive constants. 
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Equation 4.3.3. is linear in In y, and the constants ¢ and nk can be estimated 
by means of linear regression. In exponential growth, an upper limit for y does 
not existas t * CO, Therefore this function does not provide an estimate for R, 
and t must be restricted to a finite time horizon. 
In exponential decline in growth, y is a decreasing function of time. With 
an initial value of y given by y (0) >o for t=0, Equation 4.2.2 then becomes 
Vee Pyro yre | (ee 
withe < o. 
In this case the function is bounded , and ast *CO, y 40. 
In a model for decline in growth (such as proposed by Roberts and Arps for | 
the success of wildcat wells, and described in the previous Chapter) a hypothesis 
can be made that the rate of change is a constant proportion of the amout remaining, 


so that 


yo-=c.A= y), (t4b Oe) 
where A was the original amount which existed when t= — Integrating this 
expression with the boundary condition y 0) = o for t=0, peices 

y = Ale"), (4.2.6) 
which is of the same form as the function used by Roberts and Arps. In this case, 
Gah? oy ya tee tO Ce 01, 

Here, linear regression cannot be employed in the manner described for 
estimating the constants in Equations 4.2.2 and 4.2.3. A least squares computer 
algorithm could be used to estimate the coefficients of the equation. Also, linear 
regression of the discrete time function given by Equation 4.3.1 would provide an 


estimate of the constants of the equation. 


2. The Gompertz Curve: The Gompertz curve has an equation of the form 
y =Red! (497) 
where 0 <c < lando <d <1. Ast *?@,y >R. The value of yatt =ois Re, 
and as t + -09,y +0, In this double exponential equation the increments in ry 
(if t increases in equal increments) are proportional to % y, as shown later in this 
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The curve can be modified so as to have the lower asymptote equal to some 
positive value, say A, and . 

YEA tive e7le* Ft) (4.2.8) 
where R = A+ y would be the upper asymptote, 

@= a parameter determining the time origin, and 

8 = the time unit (or rate) parameter. 
This latter expression has been called the modified Gompertz curve. * 


By differentiating the equation for the Gompertz curve we obtain 


y=kyd =kye™, (4.2.9) 
2 CD RARE: (4.2.10) 
with k = (m c) (&d), anda=-nd. Since d <1, a>0. The above 

equations are bounded using reasonable limits for y andt. (e.g. y # 0 for t =0). 

From Equation 4,2.10, the second derivative of y is 

y =-ay [my/R) +1] (4.2.11) 

From this expression, it can be seen that the maximum value of y occurs 
when y = R/e. However, there seemed to be no a priori reason to assume (as one 
would in adopting the Gompertz equation) that annual production should reach a 
maximum value when 36. 9% of the resource had been produced. The maximum value 
in y would probably be determined by other factors (such as discovery rate, cumulative 
discoveries, and percent recovery) in addition to cumulative production. 

The graph of y (when plotted against t) has a long tail to the right. * In the 
United States (excluding Alaska), where production seems to be reaching a maximum, 
most of the oil in place remains to be recovered, and new discoveries are still being 
made.” Gains in secondary recovery might be subject to some law of diminishing 
returns, as percent recovery tended to its ultimate maximum value. Also, the rate of 
discovery of oil would likely be subject to an exponential type of decline, as implied 
by the Arps and Roberts model for wildcat wells which was described in the previous 
Chapter. 

The initial growth in y is similar to exponential growth, because of the term 
for y in the right hand side of the equation. The graph of y as a function of * has an 


inflection point when y is relatively small; it can be shown that inflection poinis 
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occur for y = 0,073R, and for y =0.682R. From inspection of actual growth of oil 
production in a number of states in the Unites States, and other political areas, this 
would describe what actually happens in the early phase of development. © 

Some transformations can be made of the Gompertz equation, so as to 
enable estimation of the parameters. 

We can obtain from Equation 4.2.7 

inly +1)/y(t) 1 =d) Gre) @-1), 

andwman ty (tel)/y (+) l= wea, CAs? 2} 

where a= -&nd, 

and a=Ml &rc)(d-1) ]. 
Equation 4,2,12 is linear inn & Ly (t+1)/y(t) J. All the constants (except R) 
can be estimated in this equation by means of linear regression. 

Similarly, from Equation 4.2.7, 

In \0n (y/R)|=v-at, Ceo Be ay 

witha=-& d 

and y=%&l cl 
Equation 4.2.13 is linear in Qn I (y/R). However, three parameters must be 
simultaneously determined in order to arrive at an estimate for R. 

In order to retain R in the equation, put 

x=ytl, 

B=R+2, | 

and assume that for t=0, y = 0, but that x =1. 

The upper limit of B =R + 2 is an arbitrary choice, but it was chosen so 
that gn |dn(x/B)| would be bounded as y + R, at as x > (B-1). 

The Gompertz equation with this boundary condition takes the form 

<= Be 8) eo (42.14) 

The linear difference equation then becomes 

tn on [x (t+) x(t) ]= -at, On [on B)(1-e")] (4.2.15) 


fom which an estimate of R can be obtained, if t =o whenx =1. 
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In the study by C. L. Moore, use was made of the fact that production 
data were available in consecutive annual increments. 

Thus, from Equation 4.2.7, 

One OMRER dance. 

and 


ip y) = rc) On ddd! 


= Qn d) (% y - mR) 
= (on d) Or y) = Gn d) (rR) 
A Gm y)/At = w -atmy (4.2.16) 
wherea=-% d, 
W=ahm R 

andA;}t = 1 year. 

Thus, the annua! increase in an y is a linear function of on y ; with regression 
analysis, the method would serve to minimize the surn of squares of deviations of the 
data from those calculated for A (m y)/ At. 

A difference equation can also be derived for the equation of the Gompertz 
curve. From Equation 4.2.7 we can write 

lnm y= R + d* In c 
Now let z(t) =" y, (Au2e7) 

b=%c, 
and m =mR. 
Therefore z(t) =m + b ay 
and ZtH) =m +b rh 
This pair of equations yields a difference equation 

rea ee eb de (dies 

which can be written as 

z.(¢+1) =m (I -d) +d z(t) (4.2.18) 

This is a first order linear difference equation with constant coefficients . 
Estimates of the coefficients can be obtained by the regression of Mm [y (f + 1) ]on 


or» Ly (t)]. Some other linear difference equations are described in the next Section. 
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Because of the logarithmic transformations of y, one would not expect the 
magnitude of deviations of Equations 4.2.12, 4.2.13, 4.2.15, 4.2.16 and 4.2.18 
to be normally distributed. A computer algorithm in which (untransformed) deviations 
frorn the calculated values of y (or y) were minimized would be more likely to have 
residual errors which were normally distributed. 

In any case, the results from each method were of interest. To the extent 
that these equations all led to the same estimate for R, there would be no practical 


reason for favoring one method over the other. 


3. The Logistic Curve: The logistic curve has an equation of the form 


q=R/(1+Ab), (4.2.19) 
whereo <b <1, andA >o. Ast +co, q>R. The value of gq at t=0 
is R/(I+A), and as t 4 -coO , q 4 0. The curve is also known as the Pearl- 
Reed curve, and has been used extensively to describe population growth. ° 
The curve can be modified so as to have the lower asymptote equal to some 
positive value, say ), and 
gan + 5/1 te ~ (at Bt) (4.2.20) 
Here, R=(A + y) would be the upper asymptote , 
a= 4 parameter determining the time origin, and 
8 = the time unit (or rate) parameter, 
This latter expression has been called the modified logistic curve. ® 


Differentiating Equation 4.2.19, one can obtain 


ae ol pee egtte - © (4.2.21) 
where k = a A/R, 
and a = | on b|. 
Also, q = ¢ q (R-q), (492-22) 


k/A, 
since R-q = au 


with c 


From Equation 4,2,22, the second derivative of q is 


Geaschq ih) = 2c). Wipes 


DeyKs i ee 200 Nala 
Bb Sh bw 81.55.20 oie el wed 
oli ive a YwacHtaul: Asidwe nt all . die 
eat eintesll “ony ae bhuow. baxtintnicn anawely, a) ~ balolvolns salt 

i hatadi Mate isthe ene delphi name te 
 instxe iit ol eave +40 stew better es nyo alse, o" . se2> ym a 
9407 stantihel ste sh ob bal Ilo mortoups eeodd fo 
vaho cut seve re ane gehpvel snot 


— 
a 
he fapiton on sd bhiew 95 wi) a 


' j : : = a , 

4 : 7 = la - eal At Ee 7 
“pret ari to nelleting ro 20, Syn Stee Sr eyes a2" a + white 
a! . 

(e1 5 > } . if ad A-~ iyi = p _ 

Sate prcsulev etl Hh p yee tak oe Alin Te se tnrlvs : 


: : _, . : 
wie? wil mnwonledegicxe cdl op aoe em Lad (nwa a 


ib ® Aleenp sotigluqon sdiizesh of (5. cagpeays Ying bi pine , evs test r > 
mond! loupe etotcunyes ewe! a! pyet ol eee hettiborn od aoa orale AT ~ * cl 
| | ih yh. vas alleve evititag, 
4 (5.904 yy (ise “) aha hie t ko 9 “ 


stotuntp saa art Sa Bluerw. b> e sy as "a 


mm netio outage ‘yo irarestel anibeneey a= 
oe rua ty Aa ii vo) it fea anit ay 
© eye Sttlge! batitom off DBltee mbes eor, holmerqes whol 2 at 
Coes” niblcs ne aro. &fotes nokta ides tae 
, { aaa 
\ oe) . ‘ romp a= oye p 7 
| Ae = ot oe 
on te o> me 
then | =: i 


DP . . ) 


a 


i 
} 


4 ipiaaa 
Pig ponese noose Bet 


a 


Gee 


One can see from this expression that the maximum value of q occurs when 
q=R/2. Furthermore, the graph of q plotted against t is symmetrical, with 
inflection points at q = (R/2) (1 + 372), There was no apparent basis for such a 
relationship to exist between q andt. While one might expect an exponential type 
of growth (when the relative effect of the (R - y) term was small) there seemed to 
be no reason to assume that decline in production was exactly imaged by growth in 
production. 

In order to determine the parameters in Equation 4.2.19, Hubbert plotted 
log [(R/q) - 1] for the data, using various values of R. The best (visual) linear 
relationship so obtained was selected as giving the best estimate for R, as 
outlined in the previous Chapter. 

A more convenient method of estimating the parameters has been described 
by Tintner. From Equation 4.2.19 we can write 

Vat) =1/R + (/R) tt 
and Watt lj = 17R + AsR)ib 
Thus, 1/q (t +1) = 1/q(t) = A/R) b' (b-1), 
and \7a + 1) =b/q(t) + {1 =b) 7R (4.2.24) 

If we let z(t) = 1/q(t), then Equation 4.2.24 can be expressed as the linear first 
order difference equation 

Zp) =n bez () (lee b)/R, (427225) 
which Tintner derived in order to improve on the statistical methods of estimation 
of the parameters in the logistic equation. The coefficients b and ((1-b)/R) can be 
estimated from ordinary least squares by the regression of z(t + 1) on z(t). 7°" 

With b and R found in this way, the third parameter A can be estimated by 
the method of Rhodes, From Equation 4.2.19 

q = R/(l + Ae) ( 4.2.26) 
with g= | b|. Rewriting Equation 4.2.26 in terms of the reciprocal of q 
as before we have 

1/q = 1/7R + A/R)e 
and R/q -1 Sah 
so that gn [(R/q) -l ]= mA- et (4.2.27) 
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If 8 and R are known, then A can be calculated. Equation 4.2.27 can 
be stated for successive values of t = 1,2,3,...N; by adding these equations 
and solving for gn A we obtain 


N 
ta = B(N+1)/2+ (I/N) San [ (R74) - 1 (4.2.28) 


+ 


12 
from which A can be calculated. 


Another means of estimating the parameters is offered by a computer 
algorithm in which a least sum of squares of the residuals was obtained using 
Equation 4.2.19 (or Equation 4.2.22 with Aq=q). The algorithm could 
simply seek out those values of the parameters which led to the best least 
squares fit with the data. This procedure offers the feature of specifying that 
the magnitude of the untransformed deviations of actual data from the calculated 


: 1314 
values for q (or q) should be minimized. / 


The logistic and the Gompertz curves have implied quite different values 
for R; neither curve has produced consistent results using data for q and for y. 
This is unfortunate, because both q and y should tend to the same estimate for R, 


if all the resource of oil is eventually produced. 


4. The Generalized Logistic Curve: 


A problem inherent in the foregoing equations of the type y = ky f (y,R) 
would be that the proportionality constant (k) and the ultimate resource (R) 
were the only parameters. No parameter determined the increase in growth 
in relation to the decline in growth of y or q. Thus, if one assumes that either 
the logistic, or the Gompertz curve provides the rule by which the resource of 
oil is utilized, then one implicitly assumes a unique relationship between the 
increase and the decline in y, and in q. 

In order to overcome this problem, another parameter can be introduced 
so as to balance the growth and decline rate. In Equation 4.2.9 we had 

yekye (4.2.9) 


for the Gompertz case. 
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Similarly, in Equation 4.2.21 we had 
2 eal 


q =kq“e (4.2.21) 
for the logistic case, 
. a t ; 
Thusteye= Keyes (4.2.29) 
é a t 
Gnd “Gq j—tequ ep, (4.2.30) 


with 1< &@ <2,are rate equations intermediate to the rate equations for the 
Gompertz and the logistic curves. Here & is a parameter which could serve to 
describe the inherent growth and decline characteristics. 

This three parameter equation becomes a four parameter equation when 
integrated, since 
enc /jmestceee nde 
then (y “78 =c + (/a) Bele CAL231) 
with o<(@-1) = 8<1 in the intermediate case. 

Bia yactaiyieoss 
with 1/Y= Be, and A =k/ce. 

“ yP=y/l +Ae AH) 
andy =R/(I_ +A enaty? (a7 23} 
with R= 9 , Where 1 <9 = 1/8, If 8 =1, we have the equation for the 
logistic curve. For the Gompertz case, if @ 7 1, 87 ©, and the equation is 
therefore intermediate to the logistic and the Gompertz. 

With some rearrangement of terms, Equation 4.2.29 can also be written 
in the form 

v= i(a/B\syee = y/RV ed (Ae s3) 
provided (8+ 1)=a@ #1. If we assume that 8 >o in Equation 4.2.33, the 
range of values for G@is@ < 8 <co. If 68 <0; the generalized logistic 
equation becomes a generalized exponential equation. *© It was assumed that 
the estimation of the parameters in such equations could be obtained by means 


of a least squares computer algorithm. 
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5. The Beta Function: A rate equation with several parameters might 
provide more flexibility in describing the growth and decline process. For example, 
employing the function for the Beta distribution, we could put 

y =ky" ®-y) (4.2.34) 
witho <m <I, ando<n <1, The integral of Equation 4.2.34 can be 
expressed in terms of a Beta function. If the resource of oil is produced in the 
span of time o < t <s, then the proportionality constant k is determined by 
the relationship *° 

eee MEE OE OC cays coe (4.2.35) 

The parameters R, k, m, and n might be estimated by means of a least 
squares computer algorithm - provided the algorithm converged with this number of 
coefficients to be estimated simultaneously. The value of m and n which provided 
the best least squares fit would be of interest, as well as the value obtained for R. 

If a three parameter equation was desired, then m (or n) could be set equal 
to some arbitrary value. The relative decline in growth could then be estimated 
independently from the value assessed for n, However, if the maximum in y is 
to occur before one half of the resource was produced, then m < n. 

If we put m + n=c, Equation 4.2.34 is a homogeneous function of degree 
c. Furthermore, if c =1, the equation is similar to an economic production 
function (except that y is the only variable on the right hand side). ~” 

Thus &n y = Ink + m My +(l-m)%m R-y), (4.2.36) 

withks= T'(1l-m) D'(m). 

Finally, if an assumption is made regarding the value for the time span(s), then 
Equation 4.2.36 has only two parameters (m and R) to be estimated. 

However, there should be enough parameters in the equation for y (or q ) 
so that the following could be independently estimated: 

i) the proportionality constant, k, 

ii) the ultimate resource, R, and 


iii) the relation between increase and decline in growth. (Alternatively, 
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the functional relationship of the variables on the right hand side could specify 
the nature of the increase and decline in growth). If other variables (such as q 
or w) were used in the right hand side in addition to y, then extra parameters might 


be necessary to link these with y. 


REGEREINGED: 


me ne a a 


Office of Oil and Gas, Department of the Interior, Washington, D.C., 
1966, Annex, p. 5. 


De Glenn James and Robert C. James, Mathematics Dictionary, Multi- 
lingual Edition, D. Van Nostrand Company, Inc., Princeton, N.J., 
EOD e LOs 


ahs J.C. Ahuja, Statistical Problems in the Study of Growth, Unpublished 
Ph. D. Thesis, Department of Mathematics, University of British 
Columbia, Vancouver, BD. G.,91 7037.1 lr 


4, Moore, p. 8. 

o- See Appendix A. 

6. DeGolyer and MacNaughton, Twentieth Century Petroleum Statistics, 
1969, Annual, DeGolyer and MacNaughton, Dallas, Texas, 1989, 
bps = 20% 

I Moore, pp. 9-6. 

8. James and James, p. 240. 

Da Ahuja, pe 9» 

10; G. Tintner, The Logistic Law of Economic Development, Journal Paper 


No. J - 3977, lowa Agricultural and Home Economics Experimental 
Station, Ames, lowa, 4 pages, Reprinted from the Journal Arthaniti, 


Volume IV, Number 1, 1960. 


hin G. Tintner, Econometrics, John Wiley & Sons, Inc., New York, 1952, 
pp. 208 - 209. 


7 : ari x ay ie: 
ac. bea Sol 1 bot igh weno ee ih io veo 
4 


pe 49 nein) gale bey! i wit: °F i “fethwong nif bi bk € 7 
- al 
gien natano wig are r sift ." cf olfitiiee ai 


3 


OOM. ‘ TAL haga ott > palin nto. agit: % eit wml ot 2 
7 7 Sek fray erie fl Ry ET inh “lg woo ip NOs 1D epi» 
ns ; - ‘ i * ~Y seesewath ,aneF 
a ; Jenene . 
‘= ALA operon) boat YA cosetie’l, . 7) Madlisd rie goneel cma AD ic . 
7 | ote 1 cents SMT. eBhh. «veade =) SRO nus \neithd lowe ah 7 


wast  X eae _ Pa 


7 binvtel Wurth! ite’ 15:5 bidity) coupdelon log yt iy i ‘e mf Pk at 


Harei is fs Lars! Aly beat ate } i Vw ‘nibs 7" \ cae it «0 Ral } a 


| " 


A : Ps ry 
- 9 i) on af 
7 ni ah eri ete th FT: vy cicamab hoa tras tate wat 
"4 tf ; - a “a ~lliy ; r\ A ks s.. 
¢ et a Ps. gSOHE Meri nu whl Yn iggha st gaunt h “st 
. ““ S ae 


0+ .99 veal 


bbs e y arora. tay Pate 4 
‘ - dl 


- 
} i 


: Us A | i , ry) <7 iin oan 
7 ae ancy L oamesioy: 2 aoa th? 


tnt F al ra Bi wES A i lak 16; uaest ea" a 
\ Tunauel one mae f Ligae Reasrials | dee “a oF 
7 i 
te over 
4 


- 
fs oT ariod Belt ‘ a 


ee Ab 


[pickeyeps 210. 
See Appendix "Computer Programs Used". 
F. R. Oliver, Aspects of the Maximum Likelihood Estimation of the 


Logistic Growth Function, American Statistical Association Journal, 
September, 1966, p. 697. 


ARO Gey ps 70. 


E. S. Keeping, Introduction to Statistical Inference, D. Van Nostrand 
Company, Inc., Princeton, New Jersey, 1962, pp. 83 - 85, and 
Ppp. 389 an 390. 


Tintner, pp. 52 - 54. 


7 oe >é 
: Aba ag ae shale 


a 
Bie. bh il pours vai aly ang of 


Al f soclen fe re, arte Tyo Rt, = , 


— a +G a - 


bauitial) oo .0 - ‘te Of CRS bie ha! eriyeet 26 243 i: 
| a } | 2a ,balebiht ..S8h..yRagrm 
“* Ge = Pak. aq 


— 


AR + $2 +Gy ante 


= 


= 7 


SECTION THREE - MODELS DEPENDENT UPON TIME 
- DISCRETE TIME FUNCTIONS 


1. First Order Difference Equation: The hypothesis made for a first order difference 
equation is that production during any year is a linear function of the production for 
the previous year, so that 

y (t) = ay (t- 1) +b, Geox) 

or y(t)-ay(t-1) = b (Ac3e2) 

This is a linear difference equation of order one and with constant 
coefficients. The solution to such equations can be found using the Laplace transforrnation, 
which leads to a staircase function to represent the discrete changes which occur in 
y (+) for each designated time period? However, the solution to Equation 4.3.2 is 
usually found by dropping the requirement that y (t) take on only discrete values for 
continuous time, and allowing a continuous expression for y (t) in whicht = 0, 1, 2,.. 

If y* is a particular solution to the complete equation 

y (t)-ay(t-1) = b, (4.3.2) 
and if yf is the general solution to the reduced equation 
Wat sonny aN aon) oan (4.3.3) 

then y (t) = y*+ yf (4.3.4) 
will be the general solution of the complete equation .* 

For Equation 4.3.3, the general solution is 

siecsucoeats (4.3.5) 


with ) an arbitrary constant. 
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For Equation 4.3.2, a particular solution is 
y* = A, (4.3.6) 
where A = b/(1 - a) is a constant. 


According to Equation 4.3.4, the general solution of Equation 4.3.2 


is given by 
y (t) = At da! (4.3.7) 
Now if y (t) = y (0) for t = 0, the general solution can be written as 
y(t) = A(l-e“)+ ye, (4.3.8) 


where c = Ina. Finally, if y (0) = 0 fort = 0, then 
yt) = A(l-e") (4.3.9) 

For c <0, y (t) would be asymptotic to A as t> 

Equation 4.3.9 is identical in form to Equation 4.2.6. Thus, the coefficients 
in these equations can be estimated by means of the regression of y (t) on y (t - 1) 
as in Equation 4.3.1, 

Finally, the coefficient ¢ in Equations 4.2.1, 4.2.2 and 4.2.3 may be 
related to the coefficient a in Equations 4.3.1 and 4.3.2. If y (4) = yak as in 
Equation 4.2.2, then in Equation 4.3.2 

ke“ (r+ ee = b (Ans 710) 
Now, ife = Ina, thenb = 0. Conversely, if the hypothesis = 0 is true, then 
we can infer that the constant c as determined in Equation 4.2.3 is equal to the 
constant In a, where a would be determined from Equation 4.3.1. Since the correlation 
between successive measurements depends upon their proximity, the probability that 


the hypothesis} = 0 is true is improved if At > 0, or if At = 1 and the range of t 
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2. Second Order Difference Equation: The hypothesis made for a second order 
difference equation is that production during any year is a function of the production 
of the two previous years, so that 
| y (t) = c-ay(t-1)-by (t - 2) (453 213) 
or yi t2)itay G+ It byt) =a (A937 12) 
where a, b and c are constants for a linear difference equation .4 
The corresponding homogenous or reduced equation is 
Valter 2) cay s(t 01) by (ie (4.3.13) 
As in the case of a first order difference equation, we attempt to find the general 
solution yf to the reduced equation and a particular solution y* to the complete equation; 
the general solution to the complete equation will then be 
EE i (4.3.14) 


For a trial general solution, let 


y(t+n) =m" (493.15) 
be substituted in Equation 4.3.12 in order to obtain the auxiliary equation 
matam+ b= 0 (4.3.16) 


The roots of this equation are 
m = (-at Va - 4b)/2 (453,17) 


The roots my and m, are not equal to zero, or Equation 4.3.13 would not be of second 
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order. Thus, m, and m, are 
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yf = kymy + koma, (4.3.18) 
ii) real and equal, and 
yi = (k, + ktm," (4.3.19) 
or iii) complex conjugates, and 
yt = kr cos (+8 + k,) (4.3.20) 
where m = r (cos 9+ i sin §) are the roots in polar form. In these three cases kK and 
ko are arbitrary constants é 
A particular solution to Equation 4.3.12 is 
y* = aconstant, (40071) 
SON NGiay at Gy Pe bya = ec 
andigsy =) c/(1 teat 1b) (463222) 
Thus the general solution to Equation 4.3.11 is 
y = y[tc/(lt+atb), (4.3.23) 


where yf is defined by Equations 4.3.18, 4.3.19 or 4.3.20. 
3. Polynomial in Time: The hypothesis made here is that production is dependent 


upon time, and can be estimated by means of the polynomial 


0) 1 2 m 
= 6 
y (t) Bot + 8yt + Bot sete a SN, (Ara24) 
where t = O, 1, 2, ... . mare the years for which observations of the dependent 
variable y (t) were available, = the unknown population regression coefficients, 
i= 0,1,2,...m,m<n-1, and 5 = the error term in the equation. 


Since the years t = 0, 1, 2, . . . n are not assumed to be independent, 
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t can be expressed in a power series as above, with each term t! being considered as 
a new variable (say tj) in the regression equation. Considered in this way, estimators 


A 
of the regression coefficients ( % ) can then be obtained by means of linear regression. ” © 


If the independent variables are introduced in the order, t, ae oat aa vee 
then the reduction in the residual sum of squares caused by successively higher order 
powers may be estimated by means of analysis of variance. Similarly, from a polynomial 
of degree m in time, the contribution by each of the m terms can be estimated, and only 
those terms accounting for most of the variation included in the final equation. The 
details of this procedure have been described in a number of texts, and computer programs 
are available for the calculations. ’ 

A polynomial of degree n - 1 would fit the n pairs of observations exactly, 
but there would be no degrees of freedom available to estimate the error mean square. 
Moreover, the inclusion of higher powers of t would not likely appreciably decrease 
the residual sum of squares. Therefore, some compromise is usually necessary in deciding 
on the order of polynomial to be used and the goodness of fit desired. In this study, 
terms up to the fourth power were included in the series. With m= 4, there would be 
some indication of the kind of relationship which might be anticipated between the 
successive coefficients Pi, if m >4 in an augmented power series. 

A shortcoming of Equation 4.3.20 is that it provides no estimate for the 
parameter R, and is of use only for a bounded range of t. If t *0a, then y(t)++c0, or 
OO, Lepending Usp whether @m is positive or negative. 

However, Equation 4.3.20 can be used to arrive at some inferences regarding 


the residuals (i.e. the differences between the observed values, y(t), and the calculated 
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values, yc(t).) If we hypothesize a simple (parabolic) time pattern exists for the 
residuals, then we can put 
S(t) = 8, + Bit + Bet, (Ayn S) 
A 
where 8; will be the estimators of the population regression coefficients, as before. 
The residuals in Equation 4.3, 25 are: 
A 
i) concave upward, if Be> 0, 
ii) concave downward, if 82 < 0, and 
iii) not concave if Bz = 0. 
A 
The standard error of 82, and the reduction of the mean square for error by introducing 


thé term 82 : in Equation 4.3.25 would indicate the degree of significance of such 


a trend, 
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SECTION FOUR - RELATIONSHIPS OF PRODUCTION TO OTHER VARIABLES 
- ANNUAL PRODUCTION 


To learn something of the relationship of annual production in the 


United States to other variables, a model which was proposed was 


Ay(t)=8 + 8: D(t) + ReGit) + BsH(t) +B .N(t) +Bsr(t), (4.4.1) 


where A y(t) = crude oil production in the United States during year t, 
Rj = the unknown population regression coefficients ,i=1,2,3,4,5, 
D(t) = consumption of all refined products in the United States 


during year t, 


G(t) = consumption of gasoline in the United States during year t, 


H(t) = crude oil production in the Western Hemisphere during year t, 

N(t) = crude oil production in North America during year t, and 

r(t) = proved reserve estimates for the United States at the end of 
ie He 


Equation 4.4.1 is a multivariate equation in which the estimators of the coefficients 
8; can be obtained by means of linear multiple regression. It was assumed that 
each variate did not require any transformation to reduce the equation to linear form = 
even for long term projections. 

Because crude oil production in the United States has constituted a 
large portion of domestic production, and of production in the Western Hemisphere, 


a modification of Equation 4.4.1 was also proposed, as follows: 


Ay(t) = 8 + B1[D(t) - dy (t) J + BelH(t) - Ay] 


+ 63 [N(t) - Ay(t) J + Bar(t) (4.4.2) 


This latter expression thus excludes production in the United States from the figures 


for production in the Western Hemisphere, and in North America. The term 
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[D(t) - A y(t) J] is the difference between domestic demand for all refined oils 
and domestic production of crude oil] in the United States. 

No time lags have been used in the above models. A time lag might 
improve the relationship between production to proved reserves; the last term 
in the equations would then be r(t - 7), where T is a time lag of say three to five 
years. Proved reserves are almost invariably produced in less than fifteen years - 


the average time depending upon the political region. “7” > 
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2. Governors’ Special Study Committee, A Study of Conservation of Oil and 
Gas In the United States, 1964, Interstate Oil Compact Commission, 
Oklahoma City, Oklahoma, 1964, p. 201. 


3. World Oil, August 15th Issue, 1967, p. 40. (Also see August 15th Issue 


for other years. ) 
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SECTION FIVE ~ RELATIONSHIPS OF PRODUCTION TO OTHER VARIABLES 
- CUMULATIVE PRODUCTION 


In addition to the annual estimates of proved reserves, data were also 
available on the estimated amount of crude oil in place in the United States 
according to discovery year. Cumulative discoveries, and cumulative discoveries 


of oil in place are linked by the equation 
p.r./100 =a/w, (3.4.6) 


according to the definition for percent recovery used by C. L. Moore. 

Therefore, in order to use data on oil in place in a model which would 
provide an estimate for cumulative discoveries (which might in turn be used to 
estimate production) an expression for the trend in percent recovery in terms of 
t (or y) would have to be proposed, Percent recovery has usually been conceived 
as a time dependent function, as indicated in Chapter III. It has been mainly new 
technology which has brought about new techniques for secondary recovery, and 
subsequent increases in primary recovery. 

In this study, a rather simple relationship was proposed to describe percent 


recovery, as follows: 


pif. emny tb (4.5.1) 


Here, percent recovery is solely dependent upon cumulative production. It was 
implicitly assumed that as cumulative production increased, new techniques became 
not only possible, but necessary because of the diminishing amount of oil to be 
discovered. The equation had an additional appeal of expressing what might happen 
in the final stages of production - when there would be relatively few new 
discoveries of oil in place, and percent recovery would increase only as existing 
deposits were proved up and produced. An upper limit for R can be estimated from 


Equation 4.5.1, because the ultimate percent recovery must be less than 100%. 
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A model which was proposed to relate cumulative production directly to 


cumulative discoveries was 
dy 
es _— Cc — A 
y ( woe?) 


This was an analytically simple function, which might serve to describe the 
build-up and final depletion of reserves, It satisfies the boundary conditions which 
were proposed in Section One of this Chapter, and expresses the concept that at the 
end of the decline process, each barrel of oil would be produced as soon as it was 


found, 


Pe bRee 
ee 
y Eye Cc 

= Rt (y tr) 


oe we a Rae (y ." r), 


and y tr= _yye 
_ 


When the Equation 4.5.2 is integrated, we obtain 


ny =cbrq +b, (4.5.3) 
om ey = bq <; (4.5.4) 
with b=R*~, since if y=q, r=0, andy =R, 
Equation 4.5.4 can also be stated as 
(y/8) = (a/R) (4.5.5) 
with ¢ >1 , since by definition q = y +r, and because proved reserve estimates 
are not negative. The hypothesis underlying Equations 4.5.2 to 4.5.5 is that 


incremental production (relative to cumulative production) is proportional to (but 


greater than) incremental discoveries (relative to cumulative discoveries). 
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Similar expressions were proposed for the relationships between cumulative 
production, cumulative discoveries of oil in place, and cumulative well footage 
drilled, as follows: 

ny =amwtk, (4.5.6) 
and ny=amftk (A 5978) 
where w (t) = cumulative discoveries of oil in place at the end of year t, 

f (t) = cumulative well footage drilled at the end of year t, 

and a,q, k, and k, were positive constants. 
In Equation 4.5.6, a< 1, since y=q~-r, r20, andq< w. However, in Equation 
4.5.7, g< 1, because y < R_ but no limitation on the magnitude of f has been 
imposed. 

In order to estimate the relative predictive value of q,w, and f in 
determining y, the following multivariate regression model was proposed : 

ny =8, +8, mqt 8,mwt 8, mf (Arno) 
where 8; = the unknown population regression coefficients. 

The equation is linear in the transformed variates, and thus the coetticienian. can 
be estimated by means of linear regression. 

Another way of expressing Equation 4.5.8 is 

ye akiqnyt (45559) 

Witha ==, woe feild =6., andink =e. i 

Similarly, if statistics on the cumulative number of wells drilled is used 
instead of cumulative well footage, then we could put 

in y = 8B, + 8, ong + 8, nw + Bain ¥ (4.5.10) 
where W(t) = the (all time ) cumulative number of wells drilled up to and 

including the year t. 
=, for convenience. 
Also, let gny = 85+ 8,9nq + 8,0n¥, | (4y5al4)) 
if data on oil in place were not available, or were not used. 
This latter model would be of use if data were available on the cumulative number 
of wells drilled, but not on the cumulative well footage. Equations 4.5.8 and 4.5.10 
provide a means for comparing the relative merits of q, w, fland ¥) to predict 


cumulative production. 
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If the hypothesis is made that the amouni of oil in place discovered per 
footage of wells drilled depends upon the amount of oil in place left to be discovered 
then 

GW (Tee ne USS 
provided w (f) = 0, when f =o , as in equations 4.2.6 and 4.3.9 and where 

f(t) = cumulative footage of wells drilled at the end of year t, 

w (f) = cumulative discoveries of oil in place for f cumulative footage 

of wells drilled, and 

W = ultimate resource of oil in place. 

As wells are drilled each year, new pools of oil are found and defined, 
leading to new reserve estimates, and finally resulting in production. The relation- 
ship between production and well footage drilled which was implied by a pair of 
equations described previously was 

ate (4.5.6) 
cna nepl elie =eyyi(iTee on) (4.5.12) 
Elimination of w results in a generalized form of the exponential equation. 

A simpler relationship can be hypothesized by assuming that cumulative 
discoveries of oil in place are reflected in cumulative production within a period 
of say three to seven years. This proposal would result in an equation of the form 

y (tea) =R (1-0 HM) (4.5.13) 
where 7=a time lag of say five years. This non linear equation could not be 
simplified further, because well footage was not a linear function of time. If 
cumulative well footage statistics were not available and data on the cumulative 
number of wells drilled were available, then Equation 4.5.13 could be modified to 
read 

ie oh = RiGee een} Casa) 

While a great number of other models could also have been proposed, 
it was decided to limit the models hypothesized to those given in this Chapter. 

The results obtained from these might provide the basis for more sophisticated 


models, but this step was beyond the scope of the present study. 
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CHAPTER V 
MATHEMATICAL ANALYSIS OF THE MODELS 


SECTION ONE - THE MODELS SELECTED 


Mathematical models may be divided into three general classes: 
1) purely deterministic, 
2) static or deterministic with simple random components, 
and 

3) stochastic. 
As outlined in the previous Chapter, the general hypothesis of this study is that 
a reasonably simple static model (or models) exist which can describe the time 
pattern of the utilization of oil in the continental United States (excluding Alaska). 

By hypothesizing any particular model M, an assumption H, (M) is 
implicitly made that a time paitern exists. The alternative hypothesis H, (M) is 
that no such time pattern exists, and that any time dependent analysis with M is 
invalid. 

One criterion selected for testing the relative merits of each model was 
the distribution of the residuals. It was assumed that if a model was adequately 
describing the time pattern, then the residuals should be a random sequence in 
time. This (and other tests of the models selected) is described in the next 
section. 

For such cn analysis of the time pattern of events to have any validity 
for long term projections, if was necessary to use measurements which: 

1) encompassed a fairly long span in time, and 
2) would be reasonably consistent estimators of the long term 
utilization of oil, as t became large. 
Some variables which were believed to satisfy these two criteria were as follows: 
Ay(t) = annual production, t=1859 to 1968. 
y(t) = cumulative production, t = 1859 to 1968. 
r(t) = proved reserves, t = 1920 to 1968. 
q(t) 


cumulative discoveries, t = 1860 to 1968. 
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v(t) = original oil in place by year of discovery, 
=11920'ra 963% 

w(t) = cumulative discoveries of oil in place, t= 
1919 to 1968. 

A‘ (i) = total wells completed per year, t= 1859 to 1968. 

¥ (+) = cumulative number of wells drilled from 1859 to the 
year t, inclusive. 

Af(t) = well footage drilled per year, t = 1934 to 1968. 

f(t) = estimated cumulative well footage drilled up to and 
including the year t, t = 1934 to 1968, 

= f, for convenience. 

D(t) = domestic demand for all refined products in the United 
States (including Alaska), 1919 to 1968. 

G(t) = domestic demand for all gasoline in the United States 
(including Alaska), 1919 to 1968. 

H(t) = crude oil production in the Western Hemisphere, 

N(t) 


I 


crude oil production in North America, 


The data for the above variables were restricted to the United States (and 
excluding Alaska whenever possible) and including any data on offshore activ- 
ities in drilling, discoveries and production. The data base for the above variables, 
along with references to the sources of information, has been summarized in the 
Appendices. Data for annual production were also included for North America, 
South America, and World, as well as for various regions in North America 
(Canada, Alberta, Alaska, Mexico, and Cuba). 

In order to use f(t) in an equation, an estimate of cumulative footage 
drilled up to 1933 had to be made. There had been 862,665 wells drilled during 
the period 1859 to 1933. Assuming the average footage for these wells to be 
2500 feet per well, cumulative footage of wells drilled up to and including 1933 
would therefore be roughly 2500 x 862,665 feet. This is merely a guess based 
on the fact that the average Jeon of wells during 1927 - 1933 was 2947 feet, 


while the average footage for 1934 - 1968 was 3813 feet per well. orb 
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Annual figures were available for only three variables (production, 
proved discoveries, and wells drilled) from 1859 onward, while observations for 
most of the other variables were available from 1920 onward. For this reason, 
some models were tested using data from a much shorter time period. The models 
selected in this study are summarized in Table 5.1.1. 

If a model was expressed as a function of time only, then the residuals 
could be calculated for t=1,2,3,....1T, where T was the number of years for 
which annual data were available. If the same function was expressed in incre- 
mental form, time could be removed from the expression because A t was always 
equal to one year. In this case it was assumed that annual data on the rate of 
change of a variable could be used to approximate the rate of change as expressed 
by the differentiated equation; that is, we equate 

Ay=s, (5.1.1) 
and Kao; (Sez 
for these models. 

This meant that slightly different hypotheses were being made for 

1) the time dependent function, and 

2) the same FVnciiOn in incremental form. 
The parameters determined by the two approaches might therefore be different 
because of the approximation used above. It was assumed that if the data 
covered a large enough span of years, (T> 40, say) then the differences 
resulting from this effect should be small, and that the results, from the two 
approaches could be compared. 

If there were gross differences in the parameters as obtained from a 
time dependent function, and from the same function in incremental form, 
then some doubt could be cast on the adequacy of these functions to 
describe the data. The differential equation is a direct consequence of the 
original function; the proposal of the time dependent function as being 

descriptive of the nature of the time path is nearly equilavent to the 


proposal of the rate equation (using annual data) to describe the time path. 


_ 7 As a ie 
ry - - 
aman ‘wlio iu % xed 
WH ari towed alider be Di OCS) ane (bel i 


itoton etal yer. south ee i 5 distiowsha 
ws 

ra f a : : pest : 
Paleo en! bobs emit het ram ge ee 
bs 7 


bees hat > he 
. . » Th cele al oa roe Stila 6b vecxt 


dies ered) 4 notlgau® bs 28 iyi conetee eine lobed oH 7 


ap 


ow 


i‘ 
oe) rd 
ahs wank. a4 i! MAN ay 


10 sey WO - Foul ef eow INT pleewt iby sy te TAt borliok 0 d blu | 

: SRO A) Dosa tow Hoo fan’ sae tid li sah iets av inh tour di idw 
ae ayowles a ' Cos ; 7. : ys | iver” Vi VOUT 2a hive seni il ‘ei ; om 
ay eth) 6h) po wind deen 1s io ake) ee int nl “cane ane Ot = 


: Ghetsieie ef @OVior: aout un : bi ek ee lt of bibue sD ¢ | taliey me 


: tine (ylteuns 4 stofirnin er 
- a F { 
: - . ‘ hy - . .. 
mt A a bok f | he = ay 
- 
- e 1) pb &»s 
ei (S.7 i i ' “ - & J sal 
: : ES 7 a stem - i wot : 
Pu 7 : Fe ; 
tt eri tal ented ° sv - “Tht € \ fy - . a Bie treht Att am itt . 
= ; : - Kok 7 _ 
: a) id Gf y Mi rte cata emt alt; (f , 
ee) a | | 7 
- ie ehh “y] mine -y ) it ih Neitahur artinz url (s ; 
f fas is'sd crcioverlt icin mega OAM ar d benighetel mefemowd ent 
- 5 


7 | a 
te Sie ati tedt bameio goyw th .eysido Bory wrt rinorqqn.ort to sag bond 


, 
= * - s | 
T zene hy en? siur.\ Yt ’ 4 ‘ ony 1a ngs ie wom OT eeucl > bev ee 
é 7 - 
je Per i ' \davy od thivark: te es 2 7 ese ert 
owl ef ich ati ies) Beil Ty ed) FATT Gee oe ‘ork Eaoerts> int ed ame =) 
Bi oH « benggrine oo bluew tokae gO 
| 


» tort banhytels a ZiMenicywd o i nei weer ony: aye nav snl " 


vim Jefeenerion) i qe one, see arl el bei cca on 1 ‘4uabon "i ye 


se al 


5° _ 


a | 


; a A o 

: ay ‘ ee enol Te Pa \2 Srupabis srt oh por 
- ay Ae: on Gomes ypaziien Prete ib 2] fit otaups” late 

¢ gale xe noitso’ A oa erik ott lwo 


ve 
ahr 


* 


vip ybpan # i Hatt aad aft) 
7 


edt s rata ofetoh. wavs it Lp rr Th fe us Mies ih ped Ss 


3 


Pi 7 
i 


OV. 


TABLEN Sat. 


Thiwi@DeLSeSbLEGTE De ORDESCRIBE® THE 
LONG TERM UTILIZATION OF OIL 


Model Eouation Assumed Relationship Variables Parameters 


Exponential Growth: 
M1 A223 ny =tnk+ ct y;t dink, ¢ 
M2 aoe y(t)=ay(t-1)+b y(t) a,b 


Second Order Difference Equation: 


M3 Aber. \ y(t) =¢ - ay(t-1) - by(t-2) y(t) a,b,c 


Polynomial in Time: 
M4 4.3.24 Vie ated ste. Heat: Te. t pak st Bj 


Multivariate Models: 


M5 ALAA Ay tae, 48, Di) te. Gu) +e. H@) 
+8, N(t)+8, r(t) Ay:D,G,H, 
N,r A 8; 
M6 4.4.2 Ay(t)=8,+8, [ D(t)-aAy(t) ]+8, [H()- Ay() J 


+8, [N(t)- Ay(t) ]+8, r(t) 
AY: Bins: Bj 


M7 Anand tiny =854+8 ong + 85 on wt8 3 onf 
yi Gwe f* oi 
M8 AESe I on y=8,+8, onqt8, on ¥ y34,¥ B. 
The Gompertz Curve: 
M9 4,2.7 rea yit Kose ahs 
M10 Ai2.10 A y = -ayonly/R) A yiy qpke 
Dale 
M11 Ael.17 only (t+ 1) = (1-d)(anR)+donLy (1) - 


PANS) y (t) (1-d) (on R),d 
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Model — Equation Assumed Relationship Variables Parameters 
The Logistic Curve: 
M12 A.2.19%** y =R/(1+ Ab) yit R,A, b** 
M13 Hae eee Noy ey Rim) Ayiy oh a 
Daler) 
M 14 A.2.248** W/y(tt1)=b/y(t)+(1-b)/R y(t) b,(1-b)/R 
The Generalized Logistic Curve: 
M15 eR ieee yit R,A,a,0** 
M16 A UiEN coo hk =ireliloyey 4 Ryey k,R,o** 
4.5.1 
The Beta Function: 
M17 4.2.34  Ay=ky' (R-y)" Ayiy k,m,n,R** 
M18 4.2.36 9 Ay=ky™ (R-y)!-m 
Ss ball NY AY ineako. 
Miscellaneous Relationships: 
M19 4.5.1***  y=p.r./m - b/m yiq,w m,b; 
R<(1-b) /m 
M20 eee. ny =¢ingtmnb yig c, Inb; 
on b=(1-c) on R 
M21 4.5.4 y=bq-~ yiq b,c;** 
tn b=(1-c) anR 
M22 tee ey. Siee yi¥ Roe 
i Data not available for the period 1859 to 1968. 
ee A computer algorithm to minimize the sum of squares of 
the residuals would be necessary to calculate the para~ 
meters; the parameters in all the other equations could be 
estimated by means of linear regression. 
kk 


Production rather than discoveries have been used as the 


dependent variable in the expression to allow comparisons 


between models; q (or Aq) could be exchanged for y(or Ay) 


in most models if warranted. 
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Similarly, a function could be hypothesized in difference equation form, 
resulting in slightly different estimates of the parameters than would be obtained 
if the function were expressed in differential equation form. Again, if a long enough 
span of time T is used, then this kind of error should be small. 

In some models, transformations of variates have been made in order to 
reduce the equation to linear form. These transformations included the use of 
logarithms and reciprocals of the dependent variable. Data used in such models 
were transformed prior to carrying out the regression between variates. In this case, 
the relative magnitude of the residuals was affected by the transformation. The 
residuals being minimized by least squares were in a non-linear form. 

While study of the analytical portion of the model was simplified by the 
transformation, the analysis of the stochastic portion of the model became more 
difficult under the assumptions that have been made in the previous Chapter 
regarding the nature of the residuals. This might present a problem where there 
was a wide range of values for the variates over the period being studied, such as 
from nearly zero to some large number. In any case, if a transformation led to a 
much different result than obtained when the data were not transformed, then one of 
the approaches should be rejected. The selection of the most appropriate model 
would be dependent upon the tests made on all models, including the time pattern 
of the residuals. 

Each statistical analysis was thus to be built on a model proposed to link 
opservable reality with the basic mechanism generating the observations. The model 
should be a parsimonious description of nature; its functional form should be simple, 
and the number of parameters and components should be a minimum. The model 
should be structured in such a way that each parameter can be interpreted easily 
and identified with some aspect of reality. The functional form should be sufficiently 
tractable to permit the sort of mathematical manipulations required for the estimation 


of its parameters and other inferences about ifs nature. 
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SECTION TWVO: = SeSts "OP THEIMODELS 


With a knowledge of the general characteristics of various models, and 
of the results that had been obtained using some of the models, tests which would 
assess the relative merits of each model had to be proposed. In proposing criteria, 
or tests, of the models, a distinction was made between what this writer believed 
were: 

1. the necessary properties of the model, and of its parameters, and 
2. ithe desirable properties of the model, and of its parameters. 


In other words, more weight, or credence, was assigned to some of the tests, or 


criteria. 
Let z(t) = z = the actual (untransformed) observations of the variate z, 
for which a time pattern was assumed to exist t= 1,2,3,.... 
z'(t) = z' = the values calculated for the variate z using a particular 

model ata W2 ore sk 

Also, put e(t) = z'(t) - z(t) (P18) 
= the residuals, or difference between the calculated values 
and the observed values, t= 1,2,3,.+-N, 

where N= the number of years for which data was available. 


Then some of the necessary properties of the models were assumed to be: 

A. The expression for z = y (or q) would be a monotonically 
increasing function of time. 

B. The functional relationship of the model for z = y ( or q, or 
Ay, or Aq) could be expressed in a form suitable for 
reasonably simple calculations of the time pattern. 

C. The variance of the residuals would be minimized. 

D. The model should be unbiased. (ie. ~ The residuals e(t) 


should have an expectation equal to zero.) 
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E. A single maximum exists in those models with z = A y (or Aq). 

besos y= c= k. 

Ge it'y = Ry then g = Rp andsy = q= 0. 

H. A model using z =q should result in essentially the same long 
term estimate for R as this model would give using z = y, 
because y lags q by about 10 to 15 years. 

|. The estimate of the parameter R should probably be greater than 193 
billion barrels for the United States; this figure was 50% of 
the original oil in place discovered up to 1968.7* 

J. The estimate of the parameter R should be less than 2319 
billion barrels for the United States, which is six times the 
amount of oil in place found up to 1968. (This figure is 
greater than any estimate of the ultimate resource of ofl in. 
place, and actual cumulative production would be less.) 

K. If a model for y(or q) could also be expressed in differential 
equation form, then this latter model with ee = (or Aq =) 
as in Equation 5.1.1, should result in an estimate for R 
consistent with the estimate for R obtained using y (or q) in 


the integrated form. 


Some of the features believed to be desirable for each model were: 

L. The residuals e(t) (using untransformed variates) should be 
normally distributed. 

M. The model should exhibit the feature of additivity if used for 
several regions, and then for the total of the regions. 

N. The parameters should have some meaningful interpretation 
with some aspect of reality. 

O. The model should be invariant with change in scale of the 
variables. For example, the estimate of the parameter R 
should be the same whether the variables are measured in 


thousands of barrels, or in millions of barrels. 
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P. The model should be in dimensionally balanced form. 
(This implies that homogeneous functions were preferable 
to non-homogeneous functions, if each variable in the 
function were measured in the same units, such as barrels.) 
Q. Annual data for the variables should be available for a 
period of at least 40 years, because less than one half of the 


resource has been produced up to 1968. 


The analytic features of each model were fully discussed in the 
preceding Chapter. The test statistics for the stochastic portion of the model are 
given in Equations 5.2.2 to 5 a6. 

One obvious measure of the adequacy of each model is given by the 
coefficient of correlation, 

b= eCeme arene). 2 (5.2.2) 
= the correlation coefficient of the observed values (z) 
with the calculated values (z'), 
where var (z')= the variance of z', 

var (z) = the variance of z, and 

cov (z',z) = the covariance of z' and z. 

The relative significance of p can then be assessed by the formula 

Prob ( py ue ea) (5253) 
where Ola the lower limit of p , 
Oy = the upper limit of p, and 
a =the level of significance expressed in decimal form. (If each 


variable increased with time, the fact that 9 > 0 may not be significant.) 


In multivariate linear models, the relative significance of the multiple 
correlation coefficient could be assessed. Here, one degree of freedom would 
be lost for each independent variable added. The relative significance of each 
independent variable in prediction could be measured by the estimated value 


: ae 4 
and standard error of the regression coefficients. 


= © la te 


a ea 
roe 
tte: beorate wail 
stiss nateng ie vou tna < ia ies 
e . etthd ubiloten’: reas ot het ae = a ail 


alae 


at 


7 : m - 
7 : : Re dtoncd én -dloux Wail’ sito ait ni he OMe ow Te 
- 

- y 


d btyowk ealdetioy ame tot p Inun at oil 
. 
‘\ -* grb to Merl ane nes’? eel otuneod ,aey Wh Aik id ating 


a 
é, P _ ; 
"hat Pot. gu’ bases oda ain ‘a3 
cs 


o 1 a! they Li aps OVE a 


f . - 
Vee ere doon te o dave afiyliono oT 


7 : ell ti weeny he: hee 

7 sie Labor GA Ie noilieg oitendocts adi vl 21 tebe te0) oat — 

-_ - ‘ Ph ; | Pe 
it .46 a? at anoloup 7 ig 


: . | » | i a 
/ 7 } qi# vel rig it 2 | J rth ft oi cn ee le! ae ty ongtagin Lauervde We. 


oe 7% a . . | os ence 908 


(¢.2.er | (x) touts) tev oct fh wte® a 7 ‘i 
tt satay tah aoda silt Pah tebe tt ey re avitehe . dba a “ih tf 
| ylav. det eres ; nti tine 
= dao Ac hio jnas) wy 
tir 2 t6 4 hahtey ott = 13408 


yee if 


ay en = 
eer On 


{ a > 4 1 ay! het ti « f 
wo huweriort erie vel padwost Oa Ha 1S. cy Tagore i Toga} $ ton afl 


eer te nse TMs | 
: via q ne ae howe! aah = pe 


: 


) <l tres \t tp Haat seg wat = eo | 
7 


ss YW) ymie? lomiard oi bearer Icy = Hiei, tp it aval: sol = 
ty (4rmstilngl: s | iar \h ifr Uh « ‘ng Tom toe eit casings ae et err ati 


ae i, 
* Hey there perth 


& we 


atu cy ibess? a Sergob Bio ,aTat é ee a ae a i 3) 
wt & sik i i i daa ed) .bebby olde a 4 


sittined blue ve 1 
Spitony en oF —_ ; : 


- mn 
wo Ce ele 
at i eee 


OAL 


The magnitude of the deviations is given by: 


N 
S== e(t), e254) 
ice 
N 
and SA =» | e(t) | 
tot (one ad) 


Where e(t)=z'(t) - z(t), or e=z'-z, for convenience. 
= the residuals, or the differences between the calculated values 
and the observed values. 


A more appropriate measure of the residuals is given by the sum of squares of the 


residuals, SSE, 
N : 
where SSE=y [e(t) ]? (5,220) 


Goma 
In equation 5.2.4 the expected value of S is zero if SSE is minimized. How- 
ever, there is no guarantee that S=0 if ordinary least squares is carried out 
using transformed variates. 

An unbiased estimator of the variance of the residuals, var(e), iss, 
with s* =SSE/(n-p), (5e2e7,) 
where s =the standard error of estimate, 

n = N-l, and p= the number of independent variables. 
The serial correlation of the population of residuals is given by: 
ps =cov [e(t +1), e(t) ] [var e(t +1) var e(t) 72 (552-6) 

The sample statistic tz (which is an estimate for ps) can be calculated 
from the correlation between successive residual terms. If the residuals are 
divided into two series 

5, = Yel), eb), ele). o(N-2)} (5.2.9) 
and 8, =f e(2), e(), e(4),++-e(N)f 


then fr, =the simple correlation coefficient between S$, and S, . 


: #8.-8,6) 
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A value of r, significantly different than zero would indicate that 
a time pattern existed for the residuals, and that the model was not adequately 
describing the time pattern of the rate of utilization. Unfortunately, the test 
for significance could not be used in the same way as for the usual correlation 
coefficient since it is necessay to make allowances for the regression relation- 
ship from which the residuals were derived, as in the case of lagged equations or 
difference equations. 
In a multivariate linear model, if 
Zig eo fey Mle ea lg ieee? bp Ip © cI 
=z! +e | (5.2.10) 
where Qi = the estimates of the population regression coefficients, {=1,2,3, ++ +P 
p = the number of independent variables, 
N = the number of observations, and i=1,2,3,++-N, 


then the estimate of multiple correlation R is given by 


R2 = SSR/SST (eee ih) 
Where SSR = the sum of products in regression 
Ligx ER emer oN eae iN 
mesh br oars N ae. G tio pat ie (o2012)) 
j-i 1=1 i=. i=1 


and SST = the total sum of squares for the dependent variable 
Beng Gots 2 
St Ny) (So2013) 
In this case we write 


se% = the sum of squares of the residuals 


=3(z; - z})? 
= SST - SSR = SSE (552,14) 
Also, the quantity F = MSR/MSE (52 lb) 


has an F distribution with p and n-p degrees of freedom 
where MSR = the mean square for regression 

= $SR/p, (oe2i-16) 
and MSE = the mean square for error 


= SSE/(n-p) oa 
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The results of the above computations are usually displayed in an analysis of 


variance table, such as below: 


ee 


Source of Degrees of Sums of Mean 

Variation Freedom Squares Square F 
Regression p SSR MSR MSR/MSE 
Error n-p SSE MSE 


Total n Soil! 


If the residuals are independent, and normally distributed, it may be shown 
that Bj is normally distributed with mean 8B; and an unbiased estimate of variance 

§§ = 5? Cii (5.2.18) 
where Cii is the diagonal element of the variance covariance matrix C for the 
model in Equation 5.2.10. 
Also t; = (@i - gi)/s% (552519) 
has a Student's t-distribution with n-p degrees of freedom. 

Equations 5.2.18 and 5.2.19 can give an indication of whether 

the regression coefficients are significant under the hypothesis that 

Gi =s0 el adye;sp <> -p 
A table such as the following would indicate the information required for 


regression coefficients in Equation 5.2.10 


Type of 

Estimate io bao Iya ot cet eters 
~ [> 5 

By Bo By a 

sj $y Si 

ui Py bp 


It is also possible to augment the analysis of variance table in such 
a way as to sort out only those variables Xip accounting for a significant level of 
the variation in the dependent variable. However, a stepwise regression algor- 


ithm provided substantially the required results for the purpose of this study. 
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We let Xa = the independent variable which accounts for the largest proportion 
of variation, 
Xb = the variable which accounts for the largest proportion of the remaining 
variation, 
and so on for the remaining p-2 independent variables, 
also, let r, = the partial correlation of z and Xa 
tb,q= the partial correlation of z and Xb with the effect of Xa removed 
roscl ~ the partial correlation of z and Xe with the effect of Xa and Xb 
removed. 
Then V, =rg? = the proportion of variation accounted for by Xa 
Vb (ta?) 16g 
= the proportion of variation accounted for by X, 
Vo = (-r§) I-18 a) FE ab 
= the proportion of variation accounted for by X, 
and Vj = the proportion of variation in the dependent variable accounted for by 
the independent variable Xi, when the Kis are entered in the stepwise 
order. Vi would ordinarily be expressed in percent, as indicated in the 
following layout. 
PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH INDEPENDENT VARIABLE : 


Independent Estimate of the Proportion 


Variable Accounted for (Vj) 


Xx j=a Vaq (in percent) 
Xb i—b Vp (in percent) 
Xx iliac Ve (in percent) 


The above calculations could be conveniently handled by means of International 


Business Machines A Programming Language ( APL/360) 2.5 +” 
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f a difference equation is not used, the Durbin - Watson statistic provides a 


convenient measure of the serial correlation of the residuals, with: 


dw = Celt) ~ e(t-2 J” /SSE, (S222 1) 


= the Durbin - Watson statistic. 
Tables of the upper and lower significance points for dw are given in the 
1951 article by Durbin and Watson. A low value for dw would indicate 
positive serial correlation. The test does not apply for lagged equations, or 
for the case of regression through the origin. ga 
A nonparametric measure of the nature of the residuals is given by: 


P(mS uy) = @ = the probability that the (S22522) 


observed number of runs is less than ier 


where m = the observed number of runs, 
p = the observed number of positive signs, 
n = the observed number negative signs, 
mM = the mean number of runs, 
= lot 2one/A(p en) 
o = the standard deviation of the number of runs, 
and Gg? Sian od iWon en Gol 


Usually q@= 0.025 for a 95% confidence level. ees 


Besides the above tests, it was of interest to determine the year of the 
maximum residual size. If the maximum deviation occurred at the very beginning 
(or end) of the time series, one could suspect (but not prove) that the model 
tended to deviate from the true time pattern of the observation. 

A much more powerful test of such a tendency in the model is given in 
Chapter IV by: 

elt) tht ibghet pst (4.3.25) 
If B, >o, then the residuals would be concave upward. The reduction of the 
mean square for error by introducing the term B, f would also indicate the 


significance of a simple (parabolic) time trend. 
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An indication of the normality of the residuals is given by the coeffic- 
ients of skewness and kurtosis of the residuals, which are defined by: 

Saal gee aaa (5.2.23) 

and Ku =k, /k, (5.2.24) 
where k, = Nm, + (N-1) 
ks = N° mg = (N-1)(N-2) 
k, = N® ((N#1)m, - 3(N-1)m2) 
(N-1) (N=2)(N-3) 

m, =S,-S; 

ms = Sq - 35,5, +28. 

yaaa See or ee SKS 


md Sp==- BD (elt) )', 1=1,2,3/4. 
f= 2 


By expanding, we arrive at 


3/2 
Seine (Seeeons a.) N-l Ve 
) 


ee 


(N-1) (N-2) N(S5-S 
; (5.2.25) 
end Kors UNI= 1) GINEEIMS, 4S. 9. 4169" Sa sone) 
RET See ie eae SO hloslas 32558 Je 
(5.2.26) 


If the residuals are normally distributed, the coefficients of skewness 


and kurtosis have standard deviations of 


pee 6N(N-1) 2 
(N- ea (52327) 


== (6/n) for skewness 


ale 
StU re ye ercsea N= —_ (5.2.28) 
(N=3)(N=2) (N43) (N45) 


= (24/n) = for kurtosis 


This is essentially the form in which calculations were carried out, 


using the APL/360 system. 
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SECTION THREE - ANALYSIS OF RESULTS 


The models M1 to M22 shown in Table 5.1.1 all have production (or 
some transformation of production) as the dependent variable. This allowed 
comparison of the results obtained from the various models. 

Data for three of the models (M5, M6, and M19) were available only 
for 1920 to 1968. In one model (M7) data were available for only 1934 to 1968. 
In all other models, data on production and on wells drilled were available for 
the years 1859 through to 1968; in three models (M8, M20 and M21), estimates 
of cumulative discoveries were available for the years 1860 to 1968. The series 
of estimates for cumulative discoveries for the year ending 1969 were not avail- 
able in time to be incorporated in this study. 

It was decided to study the test results from the multivariate models 
first, so as to get some idea of the relative contribution of various variables as _ 
estimators of production. These statistics have been summarized in Tables 5.3.1, 
9 deLy-JudeD, 060.4, 0.3.0, and 59.3.6.. All figures in these (and subsequent) 
tables have been rounded to four figure accuracy. Note that a number followed 
by E-k indicates that the number is to be multiplied by 10°*: 

A sition of statistics were calculated concerning the residuals in each 
model. The residuals in each of the multivariate models were not random, so far 
as the expected number of runs were concerned. This may have been caused by 
periodic highs or lows in production, which could have some basis apart from any 
long term trend. However, the residuals in models M3 and M8 were significanily 
skewed and peaked, as measured by the tests of skewness and kurtosis, and so 
were not normal in distribution. 

The fact that residuals in all the twenty-two models did not have a random 
number of runs tended to confirm the idea that some (independent) cycle of highs 
and lows for production indeed existed. The t-values for this test are shown in 
Tables 5.3.8, 5.3.20, 5.3.23 and 5.3.26. In model M3, the level of significance 


was only valid at the 2% level. However, when model M3 was expressed with y as 


continued on p.128 
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TABLE 5.3.1 
RESULTS FROM MULTIVARIATE MODEL M7 FOR CUMULATIVE 
PRODUCTION USING DATA FROM 1934 TO 1968 


| FUNCTIONAL FORM OF M7: 
ny = By + B10n A+ Bean Wt By on f 


Il ANALYSIS OF VARIANCE: 


Source of Degrees of Sums of Mean 

Variation Freedom Squares Square F 
Regression S 8.586 Leal oy 5479 
Error 3: 1.619E-2 5.223 E-4 


Total 34 8.602 © 


Il] MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS 


Type of 
i &stimate 1 =o" i=1' i =o i=3 
~ ~3.038 0.9564  -0.2035 0.3702 
Sj 0.2301 072029 0.1245 
tj 4.156 -0.7743 2a 974 
a =(0,05% 270 <U,076 


IV PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH INDEPENDENT VARIABLE : 


Independendent Estimate of the Proportion 
Variable Accounted for (Vj) 
ing 99.4924% 
on w 0.0036% 


V MULTIPLE CORRELATION COEFFICIENT : 
Ra 0.9701 


VI. STANDARD ERROR OF THE RESIDUALS: 
4s SeeRin elena 
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TABLE 5.3.2 


RESULTS FROM MULTIVARIATE MODEL M5 FOR ANNUAL 
PRODUCTION USING DATA FROM 1920 TO 1968 


FUNCTIONAL FORM OF M5: 
Ay(t)=Botp1 D(t)+g2 G(t)+8s H(t)+g4N(t)+ger(t) 


ANALYSIS OF VARIANCE: 


ee rt rt ee ee ee 


Source of Degrees of Sums of Mean 


Variation Freedom Squares Square F 
Regression ©) DARA A ed 95622 FO ol lo 
Error 43 8.03554 1.,869E3 


Total 48 Te led 


ooo 


I 


MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS: 


Type of 

Estimate i =o i =3 i=e2 i=3 i=4 i=s5 
BD; -1].544E2 -3.419E-1 3.832E-1 -2.920E-1 1.318 PAS IE-2 
Si m1. 2O0ZE-1 .2.359E=) 4 1,220E=-le We282E=1 4,205E-3 
i -2./08 1.624 -2.393 WeO2ZBE le solo? 

Oi; a0.o70 Sew a4 7G 0, 057om 0,057 


IV PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH INDEPENDENT VARIABLE : 


Independent Estimate of the Proportion 


Variable Accounted for (Vj) 
N (ft) 98.2256 % 
r(t) Lio o se 
D(t) 02307724 
H(t) 030227 3% 
G(t) 0.0169 % 


V MULTIPLE CORRELATION COEFFICIENT: 
Reese Os 97/2 


VI STANDARD ERROR OF THE RESIDUALS: 
s = 43.23 
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TABLE 5.3.3 
RESULTS FROM MULTIVARIATE MODEL M6 FOR ANNUAL 
PRODUCTION USING DATA FROM 1920 TO 1968 
| FUNCTIONAL FORM OF Mé 
Ay(t) =85 +8, [D(t) - Ay(t) +8, [H() - Ay(H)] 
+ 8 0N(t)- Ay(t)}* 8, r(t) 


Il ANALYSIS OF VARIANCE: 


Source of Degrees of Sums of Mean 
Variation Freedom Squares Square F 
Regression 4 DeGOSE7 Hr LEO 625.0 
Error 44 5.044E5 1.14654 

Total 48 27, 


MEASURES OF IGE SAMPLE REGRESSION COEFFICIENTS: 


Type of 

Estimate i=o La le i=s L=4 

Bj -5.801E] -8.406E-1 8.673E-1 3.432E-1 6.185E-2 
cf 2.300E-1 2.525E-1 5.050E-1 7.262E-3 
i -3.655 3.4385 6n795E-1 S06 


Oj 2005 eee UD eee a <0.05% 


ee en ES Se ee 


IV PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH INDEPENDENT VARIABLE : 


Independent Estimate of the Proportion 


Variable Accounted (Vj) 
r(t) 95.6777 % 
H(t) - Ay(t) 2.0698 % 
D(t) - Ay(t) 0.5164% 
N(t) - Ay(t) 0.0018 % 


VeMUETIPEE CORRELATION “COEFFICIENT: 
Ro) Os 7027, 


Vi STANDARD sERRORMOF THE RESIDUALS: 
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TABLE 5.3.4 


RESULTS FROM MULTIVARIATE MODEL M4 
FOR CUMULATIVE PRODUCTION __ 


FUNCTIONAL FORM OF M4: 
Wis iGo Batt Baten ost aa a 


ANALYSIS OF VARIANCE: 


Source of Degrees of Sums of Mean 

Variation Freedom Squares Square F 
Regression 4 6.183E10 L.s46E108 72660 
Error 105 2eDo4ae7 DeiZ7ES 


Total 109 6.186E10 


MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS: 


Type of | 

Estimate 1=o i=1 i=e i=s3 i= 4 
B ~956.9 19873 ee a4 9.075E-2 SEQ5EA 
.: 28.74 1.046 1.413E+2 GesGE-S 
t 6000 ee7E eT 6122 4,965 


a. =< 0.0570 2 020) oa 0576 < 0.05% 


a eee 


PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
BY EACH INDEPENDENT VARIABLE: 


Independent Estimate of the Proportion 


Variable Accounted for (Vi) 


(An estimate of the proportion of variation accounted for by each variable 
could not readily be calculated on a 110 x 5 data matrjx using standard 
computer algorithms on the APL/360 system. These estimates are therefore 
not given for model M4.) 


MULTIPLE CORRELATION COEFFICIENT: 
Re 059776 
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STANDARD ERROR OF THE RESIDUALS: 


s = 461.2 
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TABLE O.a%o 


RESULTS FROM MULTIVARIATE MODEL M8 
FOR CUMULATIVE PRODUCTION 


FUNCTIONAL FORM OF N28: 
iny(t) = Bo + Bang(t) + Bo mn Wit) 


ANALYSIS OF VARIANCE: 


Source of Degrees of Sums of Mean 
Variation Freedom Squares Square F 
Regression 2 849.7 424.8 13470 
Error 106 3.342 Oe ioce=2 


Total 108 853 .0 


MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS: 


ee 


Type of 

Estimate i =o i=1 i=e2 
B. e7. Olu O16 0.6378 
so oe Ol set Bal IsE2 
t | 93 26 20.49 


O. <0.05% <0.05% 


ee ne ne ne 


PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH IINDEPENDENT VARIABLE: 


a a 


Independent Estimate of the Proportion 
Variable Accounted for (Vi) 
on q(t) 98.0562% 


On W(t) 1.5520% 
MULTIPLE CORRELATION COEFFICIENT: 
R2 = 0.9961 


STANDARD ERROR OF THE RESIDUALS: 
s =0.1776 
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TABLE 5.3.6 
RESULTS FROM MODEL M3 FOR CUMULATIVE PRODUCTION 


1 FUNCTIONAL FORM OF M3: 
y(t) = c-ay(t-1) - by(t-2) 


Il ANALYSIS OF VARIANCE: 


Source of Degrees of Sums of Mean 

Variation Freedom Squares Square i 
Regression 2 6.1 G0ET0 3.065E10 7.899E6 
Error 105 4.074E5 3.880E3 

Total 107 6.130E10 


Il] MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS: 


Le, 


Type of 

Estimate ic eal ie 

8; 1.14761 2.020 -1.019 

sj 2.803E~2 2.917E-2 
ti 7.20661 -3.495E1 
0 | <0.05% <0.05% 


Vi PROPORTION OF VARIATION IN THE DEPENDENT VARIABLE ACCOUNTED 
FOR BY EACH INDEPENDENT VARIABLE: 


Independent Estimate of the Proportion 
Variable Accounted for (Vj) 
y(t-1) 99.9916 % 

y(t-2) 0.0077 % 


V MULTIPLE CORRELATION COEFFICIENT: 
Ro= 0.9777 


VIOSTANDARD ERROR OF THE RESIDUALS: 
s = 62.29 
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TABLE 5.3.9 


RESULTS FROM MODEL M3t 


FUNCTIONAL FORM OF M3!: 


= t t 
Yeruho 2 (ba ee ail 


(m, = 1.024; mz = 0.9953, rounded to four figures.) 


ANALYSIS Ge VARIANCE: 


Source of Degress of Sums of Mean 

Variation Freedom Squares Square _ F 
Regression 2 6.193610 3207/B10" 10220 
Error 107 36220E3 3 009E6 

Total 109 6.186E10 


en re eS rn a rn en RD 


MEASURES OF THE SAMPLE REGRESSION COEFFICIENTS: 


Type of 

Estimate i=0 i=] i=2 
EN 
B. -1.386E5 1.040E4 Ieo27 ES 
s, Wo25 3825 
te 84.20 427 | 
ores =<O0n1% ZAMIR TE 


MULTIPLE CORRELATION COEFFICIENT: 
R? = 0.9948 


STANDARD ERROR OF THE RESIDUALS: 
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TABLED eo 612 
CORRELATION MATRIX FOR MODEL M3 


Variable 
y(t) 0.9999 0.9998 
y(t-1) 1 0.9999 
y(t-2) l 

TWH? S Gaile 
CORRELATION MATRIX FOR MODEL M4 

Variable 
y 0.9984 0.9842 0.9421 
t 1 0.9688 0.9175 
P 1 0.9861 
+® l 
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TABLE 5.3.14 
CORRELATION MATRIX FOR MODEL M5 


Variable 
Ay(t) ] 0.9799 0.9825 0.9818 
D (t) 1 0.9968 0.9988 
G (t) / 0.9958 
H (ft) 
N (t) 
r (i) 
WABLECO+Oe 
CORRELATION MATRIX FOR MODEL M6. 
Variable 
A y(t) 1 0.8791 0.9220 0.7792 
D(t)-Ay(t) ! 0.9895 0.9620 
H(t)-Ay(t) ] 0.9500 


N(t)-Ay(t) 
r(t)-A y(t) 


0.9781 

0.879] 

0.8701 

0.6930 
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TABLED 23.16 
CORRELATION MATRIX FOR MODEL M7 


Variable 
y(t) l OS 0.9800 
q(t) 1 0.9898 
w(t) 1 
f(t) 


TABLET oe 


CORRELATION MATRIX FOR MODEL M8 


Variable 
y(t) 1 0.9902 0.9880 
q(t) l 0.9645 


Y(t) 1 


OF 7732 
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a function of time (as outlined below) the residuals were not random, and t = 9.961. 
(The results for model Mat are shown in Tables 5.3.9, 5.3.10 and 5.3.11.) 

The two roots m, and m, in the second order differential equation used in 
model M3 turned out to be: 

m, = 1.024, 

ances yw 0 199534 
Referring to Equation 4.2.23, this implies that cumulative production is related to 
time as follows: 

Vila Posee, m+ B, m, (5.3.1) with & =c/(l+ a+b) as in Equation 
4.3.22. The regression of y(t) on the right hand side of this equation using the 
above (but unrounded) values for m, and mor resulted in estimators of the coefficients 
as follows: 

y(t) =-1. 386E5 + 1.040 m, ie B27 (5222) awith: tae ed, 28; BOE 
If t = 220, then cumulative production would i 1,912,000 million barrels, a an 
annual production of 47120 million barrels. Projected over this span of time, model 
mat thus implied some rather unrealistic results. (Production for the last two years 
of this series would be greater than the entire production over the first 110 years). 

In models M5 and M8, there is a significant parabolic trend in the residuals 
with time, which suggests that any projection of production based on these models 
has a built-in bias with time. 

It is interesting that models M4 and M6 each had the maximum residual for 
the year 1968. Thus, model M7 was the only multivariate model in which the 
residuals appeared to be random normal--except for the number of runs of positive and 
negative signs. 

In many models, time (t) was used as a variable. In each case t= 1,2,3,... 
N; for the years 1859 to 1968, N= 110. The notation used for the other variables 


has been outlined in Section One of this Chapter. 
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The intercorrelations between the variables in each multivariate model 
Gre smawnelimlablevoOw oe 2MS Sule Noes. aor aw 5, core AlOwrond(o. oa! 7. 

It would be difficult to pick variables which showed a higher relationship to 
production than these. Even if other (independent) variables were found which 
were valid indicators of the rate of production, there would be problems involved 
in projecting these variables over a long period of time, so as to anticipate the 
forecast for production. 

The regression coefficients for six of the models (M1, M2, M11, M14, 
M19 and M20) could be estimated using linear regression, and the results are 
shown in Table 5.3.18. With the exception of model M19, the residuals in all 
these models had a significant parabolic trend with time, in all the models 
(except M19) the runs were definitely not randomly distributed. Also, the 
residuals were all significantly skewed and peaked, except in the case of model 
M2. 

An estimate of the ultimate resource R was implied by some of these 
models. In model 11, R was calculated as 9150 million barrels. This figure Is 
only a small fraction of the actual production up to 1968; the validity of the 
model is therefore highly questionable. Similarly, model M14 implied an estimate 
for R of 85.27 million barrels. In these cases, the estimate for R was derived 
from the intercept of the regression line. 

For model M19, R was calculated to be less than 411300 million barrels 
(which would be recovered if percent recovery were projected to 100%). In 
model M20, R was calculated to be 130500 million barrels at the point where 
cumulative production reached cumulative discoveries. Thus, among all the 
models described above, M19 was the only one which provided an acceptable 


estimate for R. 
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In all the models discussed below, computer algorithms were required to estimate 
the parameters. It was assumed that if y = o,then Ay = 0 in models M10, M13, 
M16, M18, and M17. Similarly, in model M21, it was assumed that if q =o, 
then y = 0, and in M22 if Y= 0, then cumulative production was assumed to be 
zero. Finally, in models M9, M12, and M15, it was assumed that if t= 0, then 
y =o. In this latter case, a minor adjustment to the models originally proposed 
was necessitated. 

In all the above models, regression through the origin could be employed, 
provided the assumptions made were true. In any case, the probability of the truth 
of the assumption that the intercept (bo) is zero can be estimated by calculating 
the relative improvement in regression if the intercept was not assumed equal to 
zero. 

The conputer algorithm TEST (described in the Appendix) was used to 
minimize the residual sum of squares for the nonlinear models. In each of these 
models we have 

Vai ya| ape (5.0.0) 


where z= z(t) = the observed values of the dependent 
variable, 


z' = z(t) = the values calculated to estimate z, 


e = e(t) = the error term, or residual, and t =1,2,3,...N. 


Because the regression line was constrained to pass through the origin, 
we can write 

z=bxte, cee) 
where x =x/(t) =the values calculated to estimate z, provided 


b =yx(t) z(t) 
SEG r (eer) 


= the regression coefficient for regression through the 
origin. 
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In each of the models, the effect of excluding one regression coefficient 
can be tested by the ratio 

F = (SSE'- SSE)/SSE/(N-2), (5 ar0) 
where SSE" = the residual sum of squares for error calculated 

using regression through the origin, 

SSE = the error sum of squares corresponding to the 

regression equation, 

Zo Det Dit ey, (Seo?) 
and N = the number of observations. This ratio has an F distribution 

with 1 and N-2 degrees of freedom i 

The results for four nonlinear two parameter models (M10, M13, M21 
and M22) are shown in Table 5.3.21. The hypothesis that the intercept is zero 
can be rejected here, except for model M21. 

In order to calculate an F test for the significance of the regression 
through the origin, it is more convenient to use the notation z; for z(t) and x 
for X(t), so that 

Zi Bx] tej (5730) 

=bxj + ef (Seas) 

where f is the true regression coefficient, q is the true error, and the least 
squares regression coefficient is b = (© xj z;) ee 

Since z)— Bx} * <j, 

then YD xj Zj =BDx TDG X; 
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LS e © 
Now SSE' = the minimum sum of squares for error, 
2 


ac 
=) (zj = bx: ? 
2 

BORE Ie tray ead sya) hd 

ece 
Pav braie eal aie dW ay 
= zi? = (> x42: Pee (Seo liZ) 

Also, SSE' =¥ (e;~(b-8)xi? 

= 3 (e; - e.x;}° 
= ei tel, Dx qZep mer 
= Der = ep" x7? (Ogos) 


The problem which remains is to describe the distribution 
Gy Seere 
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H] 


the variance of b. 

Now <,/o has a standard normal distribution, so that (e;)?/o? has a 
chi-squared distribution with one degree of freedom, and (g ? /o® has a chi- 
squared distribution with N degrees of freedom.” 


Similarly ep, /Dx;_* has a standard normal distribution, and eb” Dx;? 
z Sere 
has a chi-squared distribution with one degree of freedom. Thus, using Equation 


5.3.13 above, SSE has a chi~squared distribution with N-1 degrees of freedom. 
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Thus we can write 


E (SSE'/o?)=N-l, (Soe 3) 
Eoge/ (Nel) oe, (5.5216) 
and SSE'/(N-1) =: (ond 


is an unbiased estimator of g?. 


The estimate of the variance of b is 5,2 = 2 /¥D x, (Seo oS) 


b 
- a -_ 
so that ft 2a 8 (5.3.19) 
b 
has a t distribution with N-1 degrees of freedom. 
If we hypothesize that 8 = 0, 
then t? = Beis vs x) (Sez O) 
= (NEI) b2 x. Vosk: 
=< oe 


where F has an F distribution with 1 and N-1 degrees of freedom.* 

The fact that the regression line is restricted to pass through the origin 
results in a small standard error in the regression coefficient b. An alternative 
test of whether the relationship z = bx accounts for the variation in z significantly 
better than the assumption that 


— 


z Sich ta ae 
where x, = |, 
SOlunGieb, = z , the mean of z, is given by the equation 
2 
yl pig aaa (x2 /N) 4) 
2552 (ox? -ax (axe /N)/p 


where s,2 is the residual mean square of z, with N-2 degrees of freedom. This 
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(550621) 


F test has 1] and N-2 degrees of freedom, and may yield more appropriate values 
for F than the simple test of the departure of b from zero-especially if the data 


consist of a cluster of points at some distance from the origin.® 


s 
«By 4s wehoaitne bona W fa al 
rie al A) SoS = Sie 2 eS ater allt te ait itte oat 
ohn al 4 pr d 
; 4 A.-@ a qd = oF 
Wet €) ‘ : ci ine - f : 
- Hbnsy? ta. ze UES sb [- A ailiw sont Bats wad 
Oh, Tedd Bs tear! the of aw” : 
Ela Ge Ce a = (a "el d= 
| | ’ ~ _ i: * 
' 322‘ ‘\ re ey {Uv AAy a 
' : 
4 : - 5, 
2 a : 
» 
i a Se anes ely Ce aa ADEA qtiudnteta m 
iy ut vignp 318 alee eb agi wal —_ wth bart gat edt 
y at ; ; = 
‘ ar 4inl rh bi wy Io +) \ rent of’ “4 ike Paty e » Howe oe} ie josey 


Va. n fie. rr “ Ano, vit 1) Aevecie ad = 5 ShiEg Hylan elt retary i 

_ 2 ak tlt, Sowa it Sortie ‘ot ba oti 

7 a | . 7 3 d= > oT 
7 


} 1, ' sl oe 


ae oTonw 


-nevig at 2 eres oft . 5 2 oat cx 
a _ 


“1 
1a Cohen praise Sis 
| eiaprctn Sion blot 27 ashes 


. . i. 
ida 2s i iti ees Panat fi ata afl e 
. 7 Inhio aif en soot a oa 


7 an 


2184 « 


In all the models, the correlation between the right-hand side and 
the left-hand side of the equation used was highly significant. The F test 
results were not included in the tables of results; it is obvious from the 
correlation coefficient calculated and shown in the results for each model 
that a statistically significant relationship existed. 

Even in model M22 (where the hypotnests that cumulative production 
would be asymptotic to some upper value could be rejected because of the 
estimates obtained for the pararneters) there was a high coefficient of 
correlation between the dependent and independent (calculated) values. 

Thus, sequential data covering over one hundred years implied such high levels 
of significance, it was difficult to choose among the models using the coefficient 
of correlation alone. 

In model M22, values such as 6= 0, 1, and 5 were tried in the equation 
alten faye at 1 Pe ie The smallest sum ef squares for error wos obtained for 
6 =0. The model did not converge to a positive value for Rin any case, and 
so the model could be rejected. This simply highlights the problem of finding 
an independent variable which could be employed to impute an upper limit for 
cumulative production. Cumulative production of crude oil in the United States 
has been increasing at a rapid rate, greater than could be implied by model M22 
using the number of wells drilled as an independent variable. 

With the ebove exception, the three and four parameter models naturally 
tended to yield more desirable results than the models where fewer parameters 
were estimated. This was evidenced in the sum of squares for error, the number 
of runs of positive or negative signs, and the parabolic regression of the residuals 
with time. 

However, in models M9 and M12, the hypothesis that the regression line 
passed through the origin could be rejected at the 0.1% level. (In model M15, 
the rejection of this hypothesis could be made at the 2 1/2% level.) The results 
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from the nonlinear models which had more than two parameters to be estimated 
simultaneously are shown in Tables 5.3.24, 5.6.25, 5-3.20, 5.3.27, Osae20, 
and 5.3.29. Note that the mean of the residuals was not zero in the nonlinear 
models because the regression line passed through the origin. (In ordinary linear 
regression, the mean of the residuals would be zero by definition.) 

These models in which only two or three parameters were to be estimated 
simultaneously presented no special problerns(so far as rapid convergence to 
definite values of the parameters were concerned). However, models M15 and 
M17 (where four parameters including the regression coefficient were estimated 
simultaneously) converged very slowly; as soon as the accuracy of the computer 
algorithm was increased, the parameters tended to drift to quite different values. 

For this reason, computer algorithms used are not recommended for models 
such as M15 and M17. Furthermore in model M17, the values for R and n in the 
equation 

Ay = ky" (Rey)" Cpe) 
tended to increase beyond the capability of the IBM360/67 APL system. Thus, 
it was decided to restrict n to an upper limit of 0.9999. This was in keeping 


with the nature of the original model proposed, where 


Oim<l, ond Ven, 


so that O<t<s, as in equations 4.2.35 and 4.2.35. That is, if 
fe) 
y= dy =ky" (R-y)" (4.2.34) 
dt 
then y (Rey) | dy =kdi (5.3.22) 


so that i : vin (R-y) ” dye I > kdt (6.3.23) 
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if the resource R is produced during the time span 0 <t <s. Now, if we 


put x = y/R, then 


m + nies 1 =m =n s 
R x x (l-x) dx = kdt (553224) 
0 0 


Re is B ((l-m), (l-n) ) =ks (5, 33525) 


It can be seen that if s becomes very large, then R becomes very large, since 
B ((l-m), (I=n) ) is finite, provided m< 1 andn<1. If morn are greater 
than (or equal to) unity, then B( (l-m), (I-n)) =T(1-m) T (l-n) AP (2-m=n) 
may become unbounded in size, or negative, and the equation breaks down. 
For n=23, T(il-n) = -[' (22), and this is infinite. 

Using values for every five years (in order to speed calculations) the last 
optimal values form, n and R calculated by the computer algorithm were m = 0.79, 
n= 23, and R=5.5 x 10° million barrels, with both n and R increasing. Thus, 
any further attempts to obtain a least squares fit with m, n and R allowed to 
vary arbitrarily were discontinued in model M17, and n was set at 0.9999 for 
all other calculations. 

In any of the models where the test of the Durbin-Watson statistic could 
be applied, there was a positive serial correlation at the 5% level of significance. 
For example, at the 5% level of significance, with one Independent variable 
and a sample size of 100, the lower limit for the Durbin-Watson test statistic 
would be 1.65, and any lower value would confirm the hypothesis of positive 
serial correlation. In each model studied herein, this statistic was never 


greater than unity. 
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CHAPTER VI 


SUMMARY, CONCLUSIONS AND RECOMMENDATIONS 


This thesis is concerned with the long term pattern of utilization of crude 
oil. In order to limit the scope of the research to a level appropriate for a study 
of this kind, only one region was investigated, namely the continental United 
States, excluding Alaska. While there was reason to believe that the oil industry 
(which is international in nature, and dominated by eight large firms) could 
best be described using data for a more global area, it was hoped that the present 
study would at least offer a basis for such a project. 

Within the United States, the rapid rate of utilization of oil has captured 
the attention of a number of investigators. Various models of the time pattern of 
oil discovery and of production have been proposed. The main criticism of the 
models based on discoveries has been directed toward the unreliability of such 
data for proved reserves, and for oil in place. On the other hand the major 
argument against using just the statistics on actual production has been that these 
models are arbitrary, and difficult to reconcile with geological information on 
proved reserves and discoveries of oil in place. 

In the present study, a number of models were proposed to project oil 
production. It was supposed that a comparison of the results obtained would offer 
some means for choosing among the models hypothesized. Any residual pattern 
in the error terms could also receive particular attention. 

Even though the models were merely an abstraction of reality, whenever 
possible, the models hypothesized were devised to meet certain boundary conditions 
which were believed to apply at the beginning (and at the end) of the cycle of 
discovery and utilization of oil. In a few cases, the probability that such 
hypotheses were true (as opposed to the alternative hypothesis that they were not 


true for the model) could be estimated. 
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As outlined in Table 5.1.1, a total of twenty-two models were proposed 
to describe the pattern of oil production for the United States during the period 
1859 to 1968. Also, as described in Section One of ithe previous chapter, data 
were not available for some models for the entire period. 

Twelve of the models were linear, and their coefficients could be 
estimated using ordinary linear regression. Ten of the models were nonlinear, 
and the computer algorithms described in Appendix H were devised to estimate 
the various unknown parameters or coefficients. By minimizing the sum of 
squares for error with the data used in a model, the computer algorithms could 
calculate estimates of one, two or three parameters, and also the regression 
coefficient, assuming regression through the origin. All nonlinear models were 
hypothesized in such a way that regression through the origin was necessarily used. 

In each of the twenty-two models, it was apparent that the residuals were 
not random so far as the number of runs of positive and negative signs were . 
concerned. This implies that either none of the models adequately describes the 
long term utilization of oil in the United States, or that there are residual short 
term cycles of highs and lows in production to be accounted for. 

Six of the twelve linear models were multivariate in nature; none of the 
multivariate models (M3, M4, M5, M6, M7 and M8) gave any estimate of the 
ultimate resource, R. Of the remaining six univariate models (M1, M2, M11, 
M14, M19 and M20) only model M19 

y =p.r-/m-b/m Geet) 
imputed an estimate for R which was within the range of 193 to 2319 billion barrels, 
which was set as a rough criterion in Section Two of Chapter Five. 

Except for model M19, the residuals in the univariate models had a 
significant parabolic regression with time. These results, and the results for the 


nonlinear models, have been summarized in Tables 6.1.1, 6.1.2, and 6.1.3. 
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Except for model M7, all models in which some transforrnation was made 
of the dependent variable had residuals with a significant parabolic trend with time. 


However, model M7 was a multivariate model, and did not yield any estimate of R. 


Of the ten nonlinear models, four had one parameter to be estimated (in 
models M10, M13, M21 and M22), four had two parameters to be estimated (in 
models M9, M12, M16 and M18), and two had three parameters to be estimated 
M15 and M17). 

Regression through the origin was assumed for all the nonlinear models; 
in models M13 (the logistic curve for annual production), M22 (cumulative 
production as a function of the number of wells drilled), M9 (the Gompertz curve 
for annual production), M12 (the logistic curve for cumulative production), and 
M15 (the generalized logistic curve for cumulative production), this hypothesis 
(and thus the validity of the model) could be rejected at the 1%, 0.1%, 0.1% 

2% and 2.5% levels respectively. 

This left only one nonlinear model for cumulative production (M21), and four 
models for annua! production (M10, M16, M18 and M17) which could not be rejected 
because of 

i) a parabolic trend of the residuals with time, 
or ii) the regression coefficient bo was not equal to zero. 

As might have been anticipated, these nonlinear models (with one or more parameters 
to be estimated) had a much smaller standard error of the estimate. Models M16 

and M17 had the lowest standard error. In model M16, the sum of squares for error 
was 595300 for 110 observations, versus a sum of squares for error of 604500 in 

model M17 for 110 observations. Using an F ratio to discriminate between these 


two models, 


F = (604500 - 595300) / 595300 / 107 = 1.7 
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This means the probability is greater than 0.75 that model M16 offers a significant 
improvement over its nearest rival (M17), in terms of least squares. As can be 
seen in Table 6.1.2, model M16 has a much smaller standard error of the estimate 
than any of the unrejected models for annual production. For these reasons, 
model M16 (the generalized logistic curve) was a natural candidate to project annual 
production. The resource base R implied by model M16 was 325.2 billion barrels 
of crude oil, using all the data from 1859 to 1968. 

A few additional tests were made using model M146. Using data for the 
even years only from 1859 to 1968, model M16 gave an estimate of R = 331.7 
billion barrels. Using odd years only, R was estimated to be 316.5 billion 
barrels. The first 55 years, from 1859 to 1913 implied an estimate of R= 4.062 
billion barrels; the last 55 years from 1914 to 1968 gave an estimate for R of 
353.9 billion barrels. (The low estimate obtained using the first 55 years can be 
understood from the fact that the entire cumulative production up to 1913 was 
only 3.070 billion barrels, or approximately the same as the annual production 
for 1968.) 

The residuals obtained when model M16 was used with data from 1914 to 
1968 did not have a significantly low number of runs for either the odd years or 
the even years. This suggested that a short term cycle of highs and lows in 
production existed during the period, and that model M16 could not be rejected 
on the basis of any of the tests imposed upon it, as shown in Appendix M. 

However, when data for cumulative discoveries of crude oil were used in 
mode! M16 instead of cumulative production, the estimate obtained for R was 
162.7 billion barrels. This figure was less than the range for R already established, 
and so model M16 could not be used with cumulative discovery data to 


calculate Aq and an implicit estimate of R. 
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As outlined above, models M19 and M21 were the only models for cumulative 
production not rejected because of a parabolic trend of the residuals with time, or 
because by # 0, and which gave an acceptable estimate for R. Model M19 implies 
a resource base of 411.3 billion barrels at 100% recovery. At 80% ultimate percent 
recovery, model M19 provides about the same estimate for R as does M16. Both 
models M19 and M21 for cumulative production had a high standard error of the 
estimate, which reflects the relatively high variation inherent in data on 
discoveries of oil. 

Model M21 was a very coarse model, and was included only to arrive at 
some initial functional relationship between cumulative production and cumulative 
discoveries. Apart from the observed runs of positive and negative signs in the 
residuals, ond a significantly high coefficient of kurtosis, model M21 proved to 
be a reliable model. Perhaps M21 would have been more appropriately proposed 
in the form | 

Y= Bot Bia (6.1.1) 
in order to allow for the fact that q could be greater than zero when y was zero. 
However, the improvement in the sum of squares for error was not appreciable, 
and the estimate of the resource base R relatively unchanged. R= 492.4 billion 
barrels in model M21, and R = 478.7 billion barrels in model M21*. (The ten 
nonlinear starred models are discussed below.) 

In order to determine the effect of restricting the regression line to the 
origin, all ten nonlinear models were reformulated in such a way as to allow B, 


to assume any value in the equations 


y= 2+ 69%), (6.1.2) 
or AY = Bot Byh&), (6.1.3) 
where g(x) was previously defined in the nonlinear models by either 

y = Bg(x), (6.1.4) 
or Ay = Bh{x), (6.1.5) 


because regression through the origin had been stipulated. Models M10, M13, 
M21, M22, M9, M12, M16, M18, M15 and M17 were thus restated as M10*, 
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M13*, M21*, M22*, M9*, M12*, M16*, M18*, M15* and M17*. For example, 
Equation 6.1.1 became model M21* with the introduction of the regression 
coefficient B, .« 

With 8, arbitrary, the determination of the parameters was essentially the 


same as for B, = 0, except that the sum of squares for error, SSE, was given by 


N 2 
SSS 29 OF (aa Sh leh g(x; ) ) i(G3.1-0) 
t=] 
where ze the observed dependent variable (either Ay(t) or y(t)), 
oe the observed independent variable used in a model, 


g(x,) = the value of the observed independent as transformed by the particular 
function used in a model, 


b, = the regression coefficient which was an estimate of B,, 
and —_b, = the regression coefficient which was an estimate of B, . 
Since, =z-bg (Galk7) 
where z = the mean of Zier 
and g = the mean of g(x,), 
b, can be derived from b,. From the normal equations for linear regression with B, , 


b, would be given by 


by = D(z, - Z) (ol) -8) (6.1.8) 


Thus, in the nonlinear equations, the unknown parameters could be estimated in 

the same manner as for the case of regression through the origin. The same computer 
algorithms were used, except that the sum of squares for error was calculated as 
shown in Equation 6.1.6. For regression through the origin the sum of squares for 


error was calculated using Equation 5.3.12 with 
b= bzg(x) (5.3.5) 
2 
(Zol,)) 


where b was an estimate for the single regression coefficient B, as outlined in the 


previous chapter. The computer algorithms used for allmodels are shown in Appendix H. 
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The supplementary results obtained for the ten starred models are given in Appendix K, 
and are summarized in Table 6.1.4. 

When the nonlinear models were formulated so as to use a regression line 
not necessarily through the origin, estimates of the parameters in model M9* (the 
Gompertz curve) were quite different. In the other models this effect was not 
quite so marked. 

The residuals in model M16 had a slightly smaller standard error than the 
residuals for model M16*, which was probably due to the fact that model M16* had 
one extra degree of freedom for error because regression through the origin was 
not specified for the starred models. 

Among these nonlinear models for cumulative production, M15* (the 
generalized logistic) had by far the lowest standard error of the estimate. Model 
M15* implied a resource base estimate of R = 490.6 billion barrels. Using data 
for the 55 years 1859 to 1913, the computer algorithm estimated a value of R= 6.7E31 
million barrels, with the algorithm not converged, and R still increasing. Using 
the last 55 years from 1914 to 1968, M15* gave an estimate for R of 477.2 billion 


barrels. 


This was an interesting result, and model M15* may even represent an 
improvement over model M16, which was favoured earlier. 

However, lacking any way of discriminating between models M16 and 
M15*, considering the problem of estimating the three parameters simultaneously 
in M15*, and finally because M16 met all the boundary conditions set out earlier, 
(which M15 and M15* did not) the reasonable candidate among all the models 
studied would have to be model M16. 

Model M16 gave an estimate for R which was consistent with model M19. 
Model M16 allows for the fact that cumulative production is calculated simply 
by adding annual production for all the years. And finally, at the point in time 
when annual production was zero, so was cumulative production. (It would not 
be necessary to use data for all the earlier years in model M16, and the exact 
time when production was zero is independent of the cycle of increase and eventual 
decline in production which would of necessity occur during the entire process 


of utilization of the resource base.) 
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A comparison of the results obtained in models M16 and M15* for the 
first 55 years (from 1859 to 1913) versus the results obtained during the second 
55 years (from 1914 to 1968) is instructive. It appears that M16 may give a 
conservative, or low, estimate for R, because the estimate for R increases for 
the more recent data. On the other hand, model M15* produces estimates which 
decrease in value when the more recent data is used. Based on these two models, 
the resource base for the continental United States would be within the range of 
353.9 to 477.2 billion barrels. Thus, depending upon one's degree of optimism, 
a figure could be selected within this range as one of the parameters for model 
M16. For example, if a figure of 400 billion barrels were used, only one 
parameter in M16 would have to be estimated, because from Equation 4.2.33 


we could write 
Ay = by(1~-(y/400000)? ), (6.1.9) 


with p a parameter to be determined in this nonlinear equation, and b the 
regression coefficient estimated on the basis of the value determined for p. 

The value of p and R estimated by model M16 was p = 0.2171, and 
R = 325200. Using data from 1934 to 1968, if R was set at a value of 400000, 
p was estimated to be 0.1412; with these values, Equation 6.1.9 can be written 


ek 001412 


Ay © (by/6.18) (6.18 -y Mh; (6.1.10) 


Oo1 412 


because 6.18 = 400000 
This equation gives some notion of the relative effect of the term 

(RP — yP ) describing the decline in Ay. The effect of this term would be small 

at first, when y was small, but would become pronounced as y increased. As 

y increases, the term rapidly tends to zero, because of the value for p. 
Another way of illustrating the relative effect of the term constributing 


to decline in annual production is offered by model M17, where 


Ay =ky™ (R-y)" (4.2.34) 


In this model, the value estimated for m was 0.8289, with the upper limit for n 


set at 0.9999. 
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The value estimated for R was 250.9 billion barrels of oil. Now Equation 


4.2.34 can be restated as 
on Ay = Ink + many + non (R-y) (Cele lit) 


In this expression m and n may be considered as weights in a multivariate equation 
for PnAy, which is linearly related to gry and gn(R~y). Because n was found to be 
greater than m, the (R~y) term contributing to the decline in Ay would be given 
greater weight than the term for cumulative production. 

The foregoing results suggest that annual production must now be reaching 
a maximum. The problem, however, is to find some variable which would serve 
to manifest the convergence of cumulative production to some upper limit, R. 

In model M22, cumulative production did not converge with the cumulative 
number of wells drilled. The following statistics may indicate why this was so. 
The period 1934 to 1968 accounted for over 80% of the cumulative production, 
over 75% of the cumulative discoveries, and over 50% of the cumulative discoveries 
of oil in place. This same period accounted for about 60% of all wells drilled, 
but over 70% of the cumulative footage of wells, - all within the last 35 years 
of the 110 year history of the oil industry. 

An examination of the data for these variables indicates that the annual 
number and annual footage of wells drilled reached a maximum in 1956. On the 
other hand, annual production has usually increased every year - right up to 1968. 
As discussed in Chapter II], annual discoveries of oil in place (classified by 
discovery year) are revised and updated for several years as more information 
becomes available. However, the relative appreciation inherent to these 
estimates soon tapers off, and it is therefore reasonable to assume that annual 
discoveries of oil in place were at their height sometime before 1951. The data 
for annual discoveries of proved reserves, Aq, while not subject to appreciation 
(because these figures are simply new reserves proved from all fields during a given 
year) are quite variable from year to year, but a maximum appears to have been 
reached during the period 1948 to 1951. 

As an example of the generalization of the techniques developed in this 


study, consider a modified version of model M22 to project cumulative discoveries 


of oil in place, cumulative production, and cumulative discoveries. With some 
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benefit of hindsight, we can compare results from the following six cases in which 
cumulative footage of wells is used as the independent voriable, rather than 


cumulative number of wells as in model M22. 


Gee ult) =B (1-e AU), (6.1.12) 
Pehle? ower 
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The following notation was repeated for convenience: 

u = u(t) = cumulative discoveries of oil in place at the end of year t, 
classified by year of discovery. 

y = y(t) = cumulative production of crude oil at the end of year t. 

q = q(t) = cumulative discoveries of crude oil at the end of year t. 


f = f(t) = cumulative footage of all wells drilled at the end of year t. 
(Data on footage of wells drilled was available only for the 
period 1934 to 1968.) 


t= 1,2,3,...35 represented the years from 1934 to 1968 inclusive in 
Case |, Case Il, Case V and Case VI. 


t=1,2,3,...,20 represented the years from 1934 to 1953 inclusive in 
Case Ill. 


t=1,2,3,...,15 represented the years from 1954 to 1968 inclusive in 
Case IV. 


Br Bor B, were the unknown population regression coefficients estimated 
by b, b,, and b, respectively. 
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¢ was a parameter to be determined by the method of least squares, using 
the computer algorithms TEST1 or TEST2 described in Appendix H. 


6 was a correction factor for the initial cumulative footage at the end of 
1933. (As described in Section One of Chapter V, cumulative footage 
up to 1933 was estimated at 2,156,663 thousand feet, using a 
figure of 2500 feet per well drilled.) 6 can be calculated from c, 
b, and b, , as indicated below. 


All the above models are fashioned on a general model of the form 


pet reac eu olny Galle 


where U was the upper limit to which u would converge as f became very large. 
This equation can be restated as 


ee pee (Ge leas) 


Reta, 


and 6=-(n B,/B ) +e 


The two parameters c and 6, and the regression coefficient U in Equation 
6.1.16 could be estimated using the computer algorithm TEST2. Similarly, the 
computer algorithm TESTI] could be used for Equation 6.1.13. However, it was 
simpler to estimate only one parameter c (and the two regression coefficients) in 
the latter equation. The same results were obtained in either case, provided 6 
was allowed to be arbitrary. Therefore, the parameter c, and the various coefficients 
(which were dependent upon c) were estimated using the algorithm TEST] for 
Case Il, Case Ill, Case IV, Case V and Case VI. Some results from all six cases 
have been summarized in Table 6.1.5. More detailed results are contained in 
Appendix L. 

The results for Case Ill and Case IV indicate that these models for 
cumulative footage to predict cumulative discovereis of oil in place cannot be 
rejected, except for the fact that the residuals were serially correlated at the 5% 
level of significance. However, the Durbin-Watson test statistic indicated a 
positive serial correlation among the residuals in every model where the test 
could be applied. It was assumed that some short-term pattern of highs and/or 
lows existed in the data for y, q, u and f, and inasmuch as the Durbin-Watson test 


did not serve to distinguish the better models in any case, this test was rejected as 
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TABLE 6.1.5 


SOME RESULTS FROM PROJECTIONS OF CUMULATIVE DISCOVERIES OF OIL 
IN PLACE, CUMULATIVE PRODUCTION AND CUMULATIVE DISCOVERIES, 


FROM CUMULATIVE FOOTAGE OF WELLS DRILLED 


Estimates of the regression coefficients and the correction factor, 6 


Case b b, h, fon 


—— 
——_ —a 


| 442.7 


(This model could be rejected because of a parabolic regression of the residuals 
with time significant at the 0.1% level.) 


| 393 .0 -807 .2 -1123 


(This model could be rejected only because the number of runs of signs of 
the residuals was not random at the 1% level of significance.) 


HI » 008.0 -851.2 -1164 
IV S777 -594.0 = fone, 
V WEES =e lise -1044 


(This model could be rejected only because the number of runs of signs of 
the residuals was not random at the 0.1% level of significance.) 


Vi Hel bee” =205.5 -1199 


(This model could be rejected only because the number of runs of signs of 
the residuals was not random at the 1% level of significance.) 


Note: All the above figures should be multiplied by a factor of 1000 
to make them consistent with the data used. Thus, the regression 
coefficients given above are in units of billions of barrels, and 
the correction factor 6 shown is in units of millions of feet. 
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a criterion for choosing among the long term models. 

In Case Il, Case II] and Case IV the estimates of ultimate cumulative 
discoveries of oil in place (U) was 393.0, 388.0 and 397.7 billion barrels of 
oil. The ultimate recoverable resource of oil (R) would be less than the 
ultimate discovery of oil in place. However, the annual discoveries of oil 
in place should have been appreciated to their estimated ultimate value, as 
discussed in Chapter II! for annual discoveries of proved reserves. Lacking 
any data to do this, and because estimates of annual discoveries of oil in 
place appreciate to approximately their ultimate value in about five years or 
so, then on the basis of the results from Case Ill and IV, the upper limit for R 
could be set at 400 billion barrels oil. As can be seen from Table 6.1.4, this 
criterion for R would result in the rejection of the nonlinear models M10, M10*, 
M21, M21*, M9*, MIS and M15*, apart from any of the other tests imposed on 
these models. Since all these models had either been pre-empted by other models 
with a lower sum of squares for error, or had been rejected on the basis of some 
other criterion, and had no special redeeming features in any case, their 
rejection forthwith presented no problem. Models M16, M16*, M17 and M19, 
and Case V were therefore the only five of all the models proposed which implied 
an acceptable value for ultimate cumulative productive. 

While the models for Case V and Case VI could be rejected because of 
the small number of runs of signs among the residuals, the results obtained as 
an estimate of the upper bound of the recoverable resource R was instructive. In 
Case V, using cumulative production, the upper limit for y would be about 380 
billion barrels if f became very large. The standard error of the estimate was 
much less than for model M19 - the only other model for cumulative production 
not rejected vecause of the value ecimenee for R. However, in Case VI, the 
upper limit for q would be about 160 billion barrels if f became very large, and 


so this model could be rejected on the basis of this value estimated for R. 
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The relationship which had been assumed in this study between y and q 
is given by Equations 4.5.3 and 4.5.4 in models M20 and M21, respectively. 
While model M21 appeared to be among the better of all the models proposed, 
the relationship between y and q apparently may not be expressed just as a 
simple power function. Model M20 was rejected because of a parabolic 
regression of the residuals with time. Also, if we assumed that cumulative 
production followed a similar pattern to cumulative discoveries in time (or 
footage of wells drilled), and therefore tried cumulative discovery data in a 
model used for production, then a value for R was obtained which was less than 
the lower limit for R which had been established. The nature of the relationship 
between y and q was thus not revealed by the models used in this study. 

The above results point out the need to establish a more functionally 
consistent set of models linking the main variables y, q, u and f. 

This suggests the necessity for a different set of relationships between 
the main variables y, q, u and f used in this study. In Figure 2.3.1, the 
interrelationship between some of the factors in the discovery and utilization 
of oil is implied. This relationship can be restated in a simpler form, as follows: 
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2 See Se 


| (6.1.17) 
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where, for example, f > u means the amount of oil in place, u, is implied by 
cumulative footage of wells drilled, f. The model for Case II] and IV describes 
this relationship reasonably well. 
To model the relationship between cumulative production and cumulative 
footage of wells drilled, Case V might be restated as 
jaa oOe sae Lia ; (6.1.18) 
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where F = the upper limit of the cumulative footage of wells drilled, and which 
would be reached if y = R= , (less one barrel, say), the upper limit 
of cumulative production. 


In Case Ill we had 
f 


Ur UGte pres: (6.1.20) 
which can be restated as 

f= F+ = on (U-v) (6.1.21) 
Combining this equation with Equation 6.1.19 results in 

a (R-y) = con (U-v) (Gal 22) 

or y =R-(U-u)/ (6.1.23) 


Thus if the models for Case Ill and Case V are correct, then model M7 might be 
improved by incorporating the relationship implied by Equations 6.1.23 and 
6.1.19 into the model. It would be of interest to test this hypothesis. Also, 
because model M20 was rejected, model M7 could be improved further by 
substituting a more adequate relationship between y and q. 

For this purpose, it would probably be worthwhile to first study the 
nature of the relationship between f and q. The number of barrels of proved 
reserves Aq discovered by Af footage of wells, (or conversely, the footage 
of wells drilled as a result of Aq new discoveries) should be examined. 

More than likely, it would be worthwhile to separate total footage 
into wildcat footage and development footage in such an investigation. There 
probably should be close relationship between wildcat footage and discoveries 
of oil in place, whereas development footage should be related to production, 
rather than to discoveries of oil in place. Also, if discoveries of new proved 
reserves, eee be separated into additions to reserves from new fields, 
and additions to reserves as a result of development of fields already discovered, 
then improved models might be formulated, and proven superior to the models 
for q considered in this study. 

Another way of inhancing the models in such a way as to make the 


parameters more meaningful, would be to put footage drilled into the units 
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of wildcat footage drilled per cubic mile of oil bearing sediments. Similarly, 
data for discoveries of oil in place could be expressed as cumulative discoveries 
of oil in place per cubic mile of oil bearing sediments. The parameters found 
for various regions would probably have great value to interpret differences 
between various geological basins. 

The data for the above measures were not on hand for this study, and 
so could not be incorporated into any model. Outside the United States and 
Canada, such figures would probably not be available. However, it was 
planned to try some models as described above with the data for Alberta, 
British Columbia, Saskatchewan and Manitoba in a separate study. Some of 
these models could probably be improved by appreciating discovery data 
(classified by year of discovery) to their ultimate values, as indicated in 
Chapter IIl, but this was beyond the scope of the present study. 

Meanwhile, the value for R obtained by using production figures only 
in model M16 was consistent with the results obtained from Case III and IV. 
When production data for the years 1914 to 1968 were used to determine the 
parameters, and production for the years prior to 1914 was estimated based 
on these parameters, the calculated values approximated the actual values 
reasonably well (except that anneal production approached but never reached 
zero in this model.) The calculated and actual values are shown in Table 
6.1.6 for odd years only back to 1857. 

In order to calculate the parameters necessary for a projection of 
future production, a slight modification of model M16 to MI6S was necessary, 
as follows. Cumulative production at the end of year t simply became 
cumulative production for the beginning of year t+ 1. Using data from 1913 
to 1967 for cumulative production, and 1914 to 1968 for annual production in 
model M16S led to the results shown in Appendix M. By using the parameters 
thus estimated from model M16S, production could be projected from the year 
1968 onward. This projection is shown in Table 6.1.7 at five year intervals 


up to the year 2020, and at twenty year intervals after that. Maximum production 
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TABLE 6.1.6 


ESTIMATED AND ACTUAL ANNUAL PRODUCTION FROM 1857 TO 1913 
IN THE UNITED STATES, USING MODEL M16 AND DATA 
FOR THE YEARS 1914 TO 1968 


ANNUAL PRODUCTION ANNUAL PRODUCTION 
Year Estimates Actual Year Estimated Acival 
1857 0.44 0 1887 22 28.3 
1859 0.59 0.002 1889 27 35.2 
1861 0.78 2.1 1891 34 54.3 
1863 1.0 2.6 1893 42 48.4 
1865 1.4 2.5 1895 52 52.9 
1867 1.8 3.3 1897 64 oO 
1869 2.3 4.2 1899 78 5) 
1871 3.0 5.2 1901 95 69.4 
1873 4.0 9.9 1903 110 100.5 
1875 ae 8.8 1905 140 134.7 
1877 bh6 13.4 1907 160 166.1 
1879 g 19.9 1909 200 183.2 
1881 iW 27.7 1911 230 220.4 
1883 14 23.5 1913 270 248.4 
1885 17 Hs Mapai (el weer) 3069.902 3069.902 


Estimated figures have been rounded to two-figure accuracy. 


Actual figures are from Appendix B. 
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VARUECOR \a7 


ESTIMATED PRODUCTION FROM 1970 TO 2080 IN THE UNITED STATES, 


a 


ANNUAL CUMULATIVE 

YEAR PRODUCTION PRODUCTION 
1970 3000 89500 
1975 3100 105000 
1980 | 3200 121000 
1985 3200 137000 
1990 3200 153000 
1995 3100 169000 
2000 3000 184000 
2005 2900 199000 
2010 2700. 214000 
2015 2600 227000 
2020 2400 239000 
20.40 1700 280000 
2060 1100 308000 
2080 680 325000 


Estimates of annual production rounded to two-figure accuracy. 


Estimates of cumulative production rounded to three~figure accuracy. 
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was calculated to occur in the year 1985 (which was seven years prior to the year 
of the maximum rate of 3300 million barrels annually predicted by C.L. Moore 
using the Gompertz curve, and seventeen years after the maximum of 2900 million 
barrels predicted by M. King Hubbert using the logistic See 

The ultimate recoverable resource estimated by model M16S was 352.0 
billion barrels of oil if t became very large. Weeks estimate of primary recovery 
of crude oil and natural gas liquid was 270 billion barrels, and production from 
secondary recovery was estimated to eventually reach 75% of production from 
primary recovery for total world production. Mies As discussed in Chapter III, 
Hubbert had estimated the resource base at 170-175 billion barrels, while the 
value Moore estimated for R in the Gompertz equation was 330 to 381 billion 
barrels, - depending upon the date the estimate was made during the period 
1960 to 1966. 

The remarkable property of model M16 is that is describes the pattern 
of production very well for the years 1914 to 1968 - a period encompassing 
two world wars and a major aepression. Throughout this entire period, a 
constant effect appeared to be manifest in model M16 - that annual production 
was jointly dependent upon cumulative production, and the apparent amount 
of resource remaining. 

This feature of model M16, and of M17,suggested the rveans for 
generalizing the models. If the value of the ultimate resource of oil, R, is 
known, the models can be simplified considerably. The results from Case III 
and IV imply that the ultimate resource of oil in place can readily be estimated 
using data for only a few (15-20) years in a fairly well explored area such as 
the United States. Model M16 can then provide on estimate for R, the ultimate 
amount of oil which may be recovered. If the results from the models for 
Case II, Case Ill and M16 are consistent, then the estimated value of R could 


be substituted in the equation for model M15, and we have 
Ay =ky™ (R-y)" (4.2.34) 


with R known. 
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This equation can also be stated as 


On Ay = nk+m—nytnoyn(R-y), (6.1524) 


or in AY = By * By ony + Ban (R-y) (6.1.25) 


If R is given, then the last equation is in the familiar linear multivariate form. 


It would therefore be possible to include as many other variables V, , Va, Va,+++ 


as may be desired, as follows: 


lea Aye baat Bina) a. tna) ct aoe 
+ Bru myt BL on(R-y), (6.1.26) 


where fn(V; ) indicates any trnasformation of the data for V. (such as mV; iy 


exp (V;), etc.). Estimates of the k+1, unknown regression coefficients Rj, By, 


“A A 
Bo r+++ 8, could readily be obtained; the coefficients 6, and 8, would thus 


assign a weight to the terms gny and g(R-y) according to their relative predictive 


value. 


The independent variables which could be included for consideration in 


this fashion could certainly include those suggested in Section Six of Chapter Ill. 


A few of the variables which come to mind include: 


Q 
— 


population, 

total energy demand, 

cost of domestic oil, 

cost of imported oil, 

restrictions on oil imports, 

national emergency and security considerations, 
tax incentives, 

production of other fossil fuels, 

uranium production and imports, 

technological developments, such as in secondary recovery, 
volume of oil bearing rocks found, 

productive nature of various basins, 

implications of North Sea and Artic developments. 
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The above list was meant to be illustrative only, and not an exhaustive summary 


of all factors influencing oil production. However, in order to use these variables 
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in a projection, the values of each variable would have to be estimated for 
each point in time over which the series of production figures were to be 
estimates. 

Another approach that should probably be tried would be to use one 
of the models studied to describe the time pattern of the consumption of oil. 
A reasonable candidate would be model M168, or a model similar to Equation 
6.1.26, but with domestic consumption of all oils as the dependent variable. 
Such a model could then provide an estimate of the anticipated difference 
between domestic supply and demand for oil in the United States, and this 
information would be of great value to an exporting area such as Western Canada. 

On the basis of the foregoing discussion, a number of models of the 
availability and consumption of oil could be postulated. It was hoped that the 
present study may serve as a basis for such models, and provide the framework 


against which they could be tested. 
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APPENDIX A 


CRUDE OIL PRODUCTION, PROVED RESERVES, AND 1968 ESTIMATE 
OF ORIGINAL OIL IN PLACE BY YEAR OF DISCOVERY IN THE UNITED STATES 
EXCLUDING ALASKA 


1920 - 1968 
Millions of Barrels 
| ll HI IV 
VEAR ANNUAL PROVED ORIGINAL OIL IN PLACE 
ad PRODUCTION RESERVES: BY YEAR OF DISEGOVERY 
Pre- 1920 4987 c C5270. 6 
1920 445 a 7200 b 8627 
1921 474 7800 11653 
1922 559 7600 4442 
1923 733 7600 3302 
1924 AS 7500 3080 
1925 764 8500 4044 
1926 77\ 8800 13159 
1927 902 10500 5457 
1928 901 11000 7897 
1929 1008 13200 9880 
1930 897 13600 18655 
1931 849 13000 6538 
1932 784 12300 2413 
1933 905 12000 Ali7 
1934 906 12177 7774 
1935 992 12400 827] 
1936 1087 13063 1917 
1937 1265 15507 8941 
1938 1202 17348 1047] 
1939 | W258: 18483 542] 
1940 1335 19025 8776 
1941 1389 19589 6494 
1942 1374 20083 4896 
1943 1493 20064 4453 
1944 1669 20453 b 7582 


* Figures for proved reserves from 1920 to 1944 include condensate. 


+ SRT Ge a | 
ap {40 Y ee it ; 
| 


“wyar cal sled 
ileal Fo cea 7 
ti | 
; rail Ay Cm 7 APs 
| Pay, vOKTWoOM 


> NHR 


ie 
1s teh . 
‘ . > 


fAWEN R oe) cae 
rien : een 7 - 
He BIN | 
sa ak 
ay Ain 9 ; 
rehe iChat we 
cont emt ine. 
anne hs COM 
SAGE! Gas Ves 
Uae 


sia at ose mont ay = 
ae: é 
ie | 


AO x 


| I Hl IV 


YEAR ANNUAL PROVED ORIGINAL OIL IN PLACE 
PRODUCTION RESERVES* BY YEAR OF BISCOVERY 
1945 1705 19942 a 6539 
1946 1728 20874 4943 
1947 1851 21488 5749 
1948 2003 23280 9108 
1949 1825 24649 14405 
1950 1952 25268 7160 
1951 2213 27468 5391 
1952 2258 2796) 4854 
1953 2315 28945 7546 
1954 2274 29561 6380 
1955 2427 30012 5196 
1956 2560 30435 5592 
1957 2560 30300 5802 
1958 2384 30536 Bo07 
1959 2495 31716 2602 
1960 2473 31586 2615 
1961 2503 31674 1706 
1962 2543 31305 2528 
1963 2602 30895 1543 
1964 2634 30908 2494 
1965 2688 SN ICZS 1781 
1966 2850 Sli Wiss 
1967 3017 30996 1542 
1968 3058 a 30334 a 784 a 
Total. 0007) 386558 
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REFERENCES: 
Columns i), lil, dnd lV: 
(a) American Petroleum Institute, Reserves of Crude Oil, Natural Gas 


Liquids, and Natural Gas in the United States and Canada as of December 
31, 1948, Volume 23, Washington, D.C., p. 29 and p, 31. 


—— SS 


Dallas, Texas, 1968, p. I7. 
(c) See Appendix B. 
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APPENDIX B 
CRUDE OIL PRODUCTION AND ESTIMATED CUMULATIVE 
DISCOVERIES IN THE UNITED STATES, 
EXCLUDING ALASKA 
1859 - 1919 


Millions of Barrels 


ESTIMATED 

ANNUAL CUMULATIVE 
YEAR PRODUCTION DISG@V ERTIES 
1859 Os002 4a 
1860 ORS 1522.0 
1861 De 152 
1862 gail 152 
1863 20 153 
1864 pA) 166 
1865 Ze 224 
1866 £) 08) 225 
1867 SEO 231 
1868 orto) 246 
1869 1G 286 
1870 eye) 336 
187] Da2 749 
1872 (oye) 750 
1873 9.9 750 
1874 10.9 766 
L875 8.8 790 
1876 | 804 
1877 13.4 821 
1878 1oe4 878 
1879 19.9 1364 
1880 2640 1365 
1881 277, 1365 
1882 30.4 1400 
1883 Davo 1403 
1884 Dae 1420 
1885 Zieg iW7s 
1886 28.1 1880 
1887 28.3 1973 
1888 2726 1983 
1889 ops 2006 
1890 45.8 2038 
1891 54ao 2077 
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ESTIMATED 

ANNUAL CUMULATIVE 
YEAR PRODUCTION DISCOVERIES 
1894 49.3 2574 
1895 O26, 2613 
1896 61.0 2650 
1897 OES) 275 | 
1898 55.4 2755.50 
1899 D7) 3146 b 
1900 63.6 3304 
1901 69.4 3915 
1902 88.8 4021 
1903 100.5 444] 
1904 7a 4649 
1905 oA? 5014 
1906 126.5 528) 
1907 166.1 5340 
1908 178.5 5/62 
1909 183.2 618] 
1910 209.6 6305 
1911 22074 6847 
1912 222.9 7401 
1913 248.4 7647 
1914 265.8 7799 
1915 Zee! 8227 
1916 300.8 8778 
1917 BSaeo 9279 
1918 SoD ae 9606 
1919 373.40 d 9944 b 
Total 4987.2 b 
REFERENCES: 
Column II 


fa) World Oil, Vol CXLV, Auster 15 11957) panics. 
(b) Total for the Column. 


Column III 

(a) C. L. Moore, Methods for Evaluating U.S. Crude Oil Resources and 
Projecting Domesie tic Crude Oil il Availability, United States Department 
of the Interior, Office of Oil and Gas, 1962, pp. 47, 94 and 99. (The 
recovery rate prior to 1930 was 15%, ana tie 1945 recovery factor was 
20.417%, as estimated by Moore. Cumulative discoveries have thus 
been adjusted by a factor of (I5 = 20.417), to convert to recovery 
conditions as of the year the discoveries were reported. ) 


(b) Ibid, p. 100. 
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APPENDIX C 


TOTAL WELL COMPLETIONS IN THE UNITED STATES, 
EXCLUDING ALASKA 


1859 - 1968 

ie [ie 
YEAR ANNUAL WELL COMPLETIONS 
1859 GF) G 
1860 240 
1861 247 
1862 385 
1863 595 
1864 Talc? 
1865 1277, 
1866 1,142 
1867 1,256 
1868 1,064 
1869 1,682 
1870 1,639 
1871 1,452 
1872 1,166 
1873 1,244 
1874 1,304 
1875 2,458 
1876 2,960 
1877 3,956 
1878 2.988 
1879 2,817 
1880 4,216 
1881 3,876 
1882 3,271 
1883 3,018 
1884 2,365 
1885 2,955 
1886 4,308 
1887 2e\A2 
1888 2,286 
1889 6,697 
1890 8,831 
1891 3,207 
1892 3,878 
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YEAR ANNUAL WELL COMPLETIONS 
1894 7,649 
1895 13,250 
1896 14,017 
1897 9,815 
1898 8,897 
1899 14,353 
1900 17,077 
1901 15,097 
1902 16,474 
1903 19,169 
1904 20,545 
1905 16,566 
1906 18,830 
1907 19,668 
1908 16,909 
1909 18,327 
1910 14,940 
191] 13,768 
1912 17,180 
1913 25,590 
1914 23,137 
1915 14,157 
1916 24,619 
1917 23,407 a 
1918 25,696 b 
1919 29,383 
1920 34,081 
1921 21,985 
1922 24,658 
1923 24,438 
1924 217074 
1925 25,623 
1926 29,301 
1927 24,141 
1928 22,331 
1929 26 ,356 
1930 21,240 
1931 12,432 
1932 15,021 
1933 Pay nl 
1934 Lop 7 


1935 21,420 


a) 


nN ae 


YEAR ANNUAL WELL COMPLETIONS 
1936 25 ,888 
1937 32,560 
1938 27,149 
1939 27 ,340 
1940 30,040 
1941 32,140 
1942 18 A150 
1943 19477 
1944 20,202 
1945 26,879 
1946 29,228 
1947 33,122 
1948 39,778 
1949 39,038 
1950 439279 
1951 44,516 
1952 AD pO2) 
1953 49,279 
1954 53,930 
1955 56,682 
1956 58,160 
1957 537838 
1958 49,107 
1959 Sa 47, 
1960 46,728 
1961 46,914 
1962 46,139 
1963 | 43,628 
1964 45,215 
1965 4] ,404 
1966 oy sre 
1967 33,744 
1968 327807 6b 


TOTAL aes 2,189,042 


American Petroleum Institute, Personal communication, Washington, D.C., 


January 22, 1970. 
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DeGolyer and MacNaughton, Dallas, Texas, 1969, p. 31. 
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APPENDIX D 


FOOTAGE DRILLED, NEW WELLS, IN THE UNITED STATES, 
EXCLUDING ALASKA 


1934 - 1968 
I. 
YEAR ANNUAL WELL FOOTAGE 
(Thousands of Feet) 

1934 49,804 
1935 61,523 
1936 77072 
1937 99,923 
1938 87,497 
1939 837,829 
1940 91,870 
1941 95 ,387 
1942 64,813 
1943 58 ,504 
1944 80 ,838 
1945 92,982 
1946 101,125 
1947 27 Si6 
1948 136,709 
1949 138,617 
1950 159,761 
1951 176,757 
1952 188,393 
1953 198 ,432 
1954 211,296 
1955 228 ,530 
1956 2304000 
1957 220 , 864 
1958 196,507 
1959 206 ,057 
1960 194,402 
1961 191,973 
1962 197,833 
1963 185,729 
1964 190,794 
1965 178,034 
1966 167,612 
1967 146,266 
1968 __ 148,923 

Total 5,056, 884 


REFERENCE: oe 
DeGolyer and MacNaughton, Twentieth Century Petroleum Statistics, 1969, DeGolyer and 
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APPENDIX E 


DOMESTIC DEMAND FOR ALL ‘REFINED OILS AND FOR 
GASOLINE IN THE UNITED STATES 


1919 - 1968 
(Millions of Barrels) 


YEAR POLES GASOLINE 
1919 B75 89 
1920 457 109 
121 A458 117 
1922 531 138 
1923 652 174 
1924 688 197 
1925 OME 233 
1926 781 268 
1927 803 305 
1928 861 Soy, 
1929 94] 383 
1930 927 398 
121 903 408 
1932 835 378 
1933 868 380 
1934 920 A410 
1935 984 435 
1936 1093 482 
1937 1170 B19 
1938 1137 523 
1939 1231 556 
1940 1327, 589 
194] 1486 668 
1942 1450 589 
1943 1521 568 
1944 167] 632 
1945 es: 696 
1946 W793 735 
1947 1990 v95 
1948 ZA. 87] 
1949 2118 914 
1950 28795 994 
1951 2570 1090 
1952 2664 ey? 
1953 2775 1206 
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YEAR ALL OILS GASOLINE 
1955 3088 1334 
1956 CYAIS) 1373 
1957 3219 1393 
1958 3315 1436 
1959 3450 1485 
1960 3536 Tol 
196] 3079 1533 
1762 3735 1583 
1963 3854 1635 
1964 3959 1658 
1965 4125 1702 
1966 4325 1793 
1967 448] 1843 
1968 4788 W235 
REFERENCE: 


DeGolyer and MacNaughton, Twentieth Century Petroleum Statistics, 1969, 
DeGolyer and MacNaughton, Dallas, Texas, 1969, p.p. 46-47. 
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APPENDIX F 


CRUDE OIL PRODUCTION IN NORTH AMERICA 
FOR VARIOUS REGIONS 


1862 - 1968 
( Thousands of Barrels ) 


| T HII IV V Vl 
YEAR CANADA ALBERTA ALASKA MEXICO CUBA 
1862 12 
1863 83 
1864 90 
1865 110 
1866 175 
1867 190 
1868 200 
1869 220 
1870 250 
1871 270 
1872 308 
1873 365 
1874 169 
1875 220 
1876 312 
1877 312 
1878 312 
1879 575 
1880 350 
1881 275 
1882 275 
1883 250 
1884 250 
1885 250 
1886 584 
1887 526 
1888 695 
1889 705 
1890 745 
189] 755 
1892 780 


i. 
QDIKIM AARALA ADM BRIA 


=a A125 


| II II IV V Vi 
YEAR CANADA = ALBERTA ALASKA = MEXICO CUBA 
1894 829 
1895 726 
1896 727 
1897 710 
1898 758 
1899 808 
1900 913 
1901] J37 10a 
1902 01 AO 
1903 487 75 
1904 553 126 
1905 634 251 
1906 569 502 
1907 789 1005 
1908 528 3933 
1909 421 2714 
1910 316 3634 
1911 291 12553 
12 243 16558 
Tes 228 25696 
1914 2S 26235 
1015 215 32911 
1916 198 40546 
11Z, 2144 50293%d 
1918 305 b 63828 b 
1g, 241 87073 
1920 196 157069 
192] 188 193398 
1922 179 182278 
W722 170 Zea 149585 
1924 16] Dib 139678 
1925 332 172 Usps 
1926 364 2 90421 
1927 477 326 6412] 

1928 624 492 50151 

1929. ale O72 44688 

1930 1522 1445 37030 

1931 1543 1444 33039 

1932 1044 918 32805 

1933 1145 827 34001 23h 
1934 1417 852 38172 28 
1935 1447 767 40241 A7 
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| I II IV V VI 
YEAR CANADA ~— ALBERTA ALASKA = MEXICO CUBA 
1937 2944 2140 46907 33 
1938 5966 6212 38506 78 
1939 7838 TPA 42898 112 
1940 859] 8221 44036 142 
194] 10134 9616 42196 150 
1942 10365 9834 34815 ey 
1943 10052 9213 35163 107 
1944 10099 8341 38203 109 
1945 8483 7643 43547 149 
1946 7586 6704 49235 269 
1947 7692 6382 56284 300 
1948 12287 10505 58508 hy, 
1949 21305 19768 60910 206 
1950 29044 27149 72443 156 
173) 47615 45836 V7 oiNz 128 
1952 61237 58837 TS) 36 
1953 80899 76702 72440 We 
1954 96080 87593 83653 YS 
as 129440 112853 89406 S75 
1956 171981 143682 90660 543 
1957 181848 136766 88266 395 
1958 165496 112471 G5930 344 
159 184778 128802 96393 192 
1960 189534 130499 99049 108 
196] 220861 157766 6327 a 106784 80 
1962 244139 165098 10259 111830 90 
1963 258435 168670 10740 114867 UP 
1964 274626 175089 11059 115576 264 
1965 292308 153727, 11128 La WAC) 264 
1966 320467 202502 14358 121149 260 
1967 Shey |7asy/ 2ZO05)7 29126 133042 756 
1968 417691 b 250675 b 66203 a 141960 b 146 a 


ee ee ee ee oe eee 


TOTAL 3880151 2716291 159200 4313508 6376 
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REFERENCES: 
Column I: 
(a) G. H. Barrows, International Petroleum Industry, Volume 1, 


International Petroleum Institute, Inc., New York, 1965, p. 270. 


(b) De Golyer and MacNaughton, Editors, Twentieth Century 
Petroleum Statistics , 1969 (Annual), De Golyer and MacNaughton, 
Dallas, Texas, 1969, p. 5. 


O44 
Pass 
Sie 
eis 
Orhex 
Gent 
Mobos 
O&we 
cosh) 
ceecy 
Es igre 
QhOre 


a aL, 


Oeste 


aber 
GVA? 
eee | 
-. SReees 
aytas! 
(Sent 
Stes! 
Tweed 
oomTat 
aespet 
ONesa! 
OBde<? 
eprant i 
S7es04 


gquggugageteserey 


os 


PEStaeEes 


-Al4- 


REFERENCES cont'd 


Column III: 

(a) Petroleum and Natural Gas Conservation Board, Alberta Oil 
and Gas Industry, Calgary, Alberta, 1952. (Cumulative production 
up to and including 1922 was given at 63,000 barrels. ) 


(b) Oil and Gas Conservation Board, Cumulative Annual Statistics, 
Report 69 - 17, Oil and Gas Conservation Board, Calgary, Alberta, 
1968, 9. 305. 


Column IV: 


(ae De Golyer and MacNaughton, p. 23. (Data for Alaska was included 
with Nevada prior to 1961.) 


Column V: 
(a) G. H. Barrows, p. 274. 


(b) De Golyer and MacNaughton, p. 5. 


Column VI: 
(a) De Golyer and MacNaughton, p. 5- 
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APPENDIX G 


WORLD CRUDE OIL PRODUCTION 


1857 - 1968 


(Millions of Barrels) 
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| | HI IV 
NORTH SOUTH 1OTAL 
YEAR AMERICA AMERICA WORLD 
1926 861.700 C7 Nov 1097 .000 
1927 965.700 102.800 1263 .000 
1928 952.200 155%500 1375000 
1929 1053.100 190.800 1486 .000 
1930 939.100 189,500 1410,000 
1931 885.700 168,200 1373 .000 
1932 819.000 167.800 1310.000 
1733 940.800 169.100 1442 .000 
1934 947.700 196.500 1522 .000 
1730 1038.000 210.800 1654.000 
1936 1142.000 221.700 1792 .000 
1937 1329.000 258.400 2039 .000 
1938 1260.000 262.900 1988 .000 
1939 1316.000 284.200 2086 .000 
1940 1406 .000 268.800 2150.000 
1941 1455 .000 309.100 2221.000 
1942 1432.000 220-5200 2093 .000 
1943 1551.000 297 7300 2297000 
1944 1726 .000 343.400 2592) 000 
1945 1766.000 406.500 2595000 
1946 1791 .000 466.700 2745 .000 
1947 1921 .000 5175000 3022 .000 
1948 2091. 000 574.900 3433 .000 
1949 1924.000 573.400 3404 .000 
1950 2075.000 644.100 3803 .000 
1951 723732000 726.700 4283 .000 
1952 2428 .000 766.200 4505 .000 
1953 2510.000 756.300 4798 000 
1954 2495 .000 809.700 5017.000 
1955 2704 .000 910.700 5626 .000 
1956 2880.000 1037.000 6125.000 
1957, 2887 .000 1169.000 6438 .000 
1958 2708 .000 1121.000 6608 .000 
1959 2856 .000 1204,000 7145.000 
1960 2264 .000 1266.000 7690 .000 
1961 2949 000 1319.000 8184,000 
1962 3032 .000 1439.000 8882 .000 
1963 3126 .000 1468000 9538 000 
1964 3177000 1531.000 10309.000 
1965 3259.000 1564 .000 11058 .000 
1966 3470.000 1549.000 12022 .000 
1967 3701 .000 1650.000 12913 .000 
1968 3889.000b 1684,000b 14077 ,000b 
TOTAL 95, 902.224 oe 210,782.577_ 
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REFERENCES: 
Column Il: 
(a) G. H. Barrows, International Petroleum Industry, Volume 1, 


International Petroleum Institute, Inc., New York, 1965, 
pp. 268 - 278. (Totals for Canada, Mexico, and United States. ) 


(b) De Golyer and Mac Naughton, Twentieth Cent tury Petroleum 
Statistics, 1969, De Golyer and ! Mac Naughton, Dallas, Texas, 
G9 es 

Column III: | 

(a) G. H, Barrows, pp. 268 = 278. (Totals for Argentina, Bolivia, 


Brazil, Chili, Columbia, Ecuador, Peru, Trinidad, and 
Veer ) 


(b) De Golyer and Mac Naughton, p. 4. 
Column IV: 

(a) G. H. Barrows, p. 268. 

(b) De Golyer and Mac Naughton, p. 4. 


Note: Most of the data for this Exhibit have been rounded to four figures; 
extra zeros have been added merely for ease in reading. 
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APPENDIX H 


COMPUTER PROGRAMS TO MINIMIZE THE SUM OF SQUARES 
BETWEEN THE OBSERVED AND CALCULATED VALUES 


M is a tx2 matrix with the observed (dependent) values z=z(t) in the first 
column, and observed (predictive) values x=x(t) in the second column. The 
invoked function FN calculates the values z' = bf(x; u, v,w)=bx which are an 
estimate for z_ in the case of regression through the origin. 

TEST 3 is an APL\360 computer program which seeks out the values of the 
parameters u, v and w which minimizes the sum (z~z')?. The accuracy of the 
parameters can be specified by putting DU, DV and DW in the vector A to be as 
small as may be desired. 

TEST 3 indicates the particular direction of the search(i.e. -- whether u,v 
and/or w increase or decrease) in minimizing the sum of squares for error. The 
algorithm then displays the sum of squares for error 5 (z~z')® over all the observations, 
gives some indiation of the relative change in this sum by varying u, v and w, and 
finally produces a matrix MAT of observed values z in the first column, and calculated 


values x in the second column. The algorithm is shown below: 
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If only two parameters (u and v) are to be estimated, TEST 3 can be 


simplified to the algorithm TEST 2 shown below: 
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Similarly, if only one parameter (u) is to be estimed, TEST 1 as shown 


below can be used to estimate u% 
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To quickly obtain an initial rough approximation of the parameters u, v and 
w, the number of observations in M can be reduced considerably, and the number 
of rows in the matrix TE of TESTS can be reduced from 27 to 9, as illustrated in the 
algorithm EST3 below: 
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In each nonlinear model where regression was carried out through the 
origin, the second line of the invoked algorithm FN would be the same. For 


model M10, Ay = ~ay dn (y/R), and the function FN used was as follows: 


VSSE<R FN if 
Lij X<x£0-+/ (XO<ME 3:2 )xeM[ :21+R)s0 


(2) “SSh<+/ (YO-xOxR=(4/7OxXN 0) 24/2022 ie? 
Poet ey 


For model M13, Ay =cy (R-y), and line 1 of FN was 


ba XN O-4/ (KOM 32x R ae 3250270 


For model M21, y = ba°, and line 1 of FN was 
Pay XaXO] C4 / KOM Olek) 20 
For model M22, y(t) = Rie and line 1 of FN was 
Dd Oe = (ef MOS axis 2 1) SNC 
t 
For model M9, y = R(ct -c) (so that y = 0 when t = 0) and line 1 of FN was 


Pa AKO CF / KOR oa 2 1) ei ao 


For model M12, y = R(I/(1 + Ab‘) - 1/(1+A) ), (so that y = 0 when t = 0) and 


line 1 of FN was 


Pah XerO=(4/NOH (1 aR Keo a a te ae 


For model M16, A y =ky (1-(y/R) By and line 1 of FN was 
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For model M18, A y = ky™ (R-y) uh and line 1 of FN was 
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For model M15, y = R(II/(1+ Ae “1 8 - /+A)9), (so that y = 0 when t = 0) and 
line 1 of FN was 
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7 and line 1 of FN was 


For model M17, Ay =ky™ (R-y) 
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For the case of a regression line through 8, #4 0, the function FN can be ~ 


used to calculate the values z'=b) +b, f(x;u,v,w) as an estimate for z. In 
[21 Sober] (Y-2xB1<(4/2xY )e4/ 242 ) 42 


and SSE would be the sum of squares for error as obtained in ordinary linear 
regression. Here, the estimated regression coefficients b, and b, are tien by the 
normal equations if the parameters above have been estimated, and the sum of 
squares for error, SSE, can be written as 


2 


SSE= 5(Y; - by - bX) 
= p(y; - by x;)” 


with y; = Yj - Y, and x; =X; - X 
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When using a regression line through Bo # O in models M9*, M12* and M15*, 
the assumption that y = 0 when t = 0 is not necessary. Thus, for model M9*, 


rae Rend line ucn ENE 
[1] X+XO-(4+/X0«RE1J*RE2 «UL 3 23) 80 

For model M12", y = g, + R/(1+Ab!), ond line 2 of FN was 
(aa AAROM4/ KON 1 RE1 KAO eo 0 

For model MI5*, y = B, + R/(1+ Ae) ® and line 2 of FN was 
(ala VED Sieh AA IC REE Oe CO 


For some models with a regression line such that 8, was arbitrary, the first line of 
the function FN could be simplified because it was not necessary for y to be zero 


fort =0. For example, in model M22*, line 1 of FN was 
Li} X<xXO-(+/40+*RxH(32]))2H0 


This was the algorithm used for the models given in Case Il, Case Ill, Case IV, 


Case V and Case VI. In the model for Case |, FN was the same as for model M22. 


REFERENCE: 


A.D. Falkoff and K.E. Iverson, APL 360: User's Manual, International 
Business Machines Corporation, New York, N.Y., 1968. 
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APPENDIX | 


COMPUTER PROGRAM TO CALCULATE STATISTICS 
FOR REGRESSION THROUGH THE ORIGIN 


If Y is a vector of N observations of the dependent variable, and X 
is a vector of N observations of the independent variable, the program Y REG®D X 


calculates: 


le The mean of X, MX =SX +N, with SX =X. 
2. Themean or ¥, MY =SY ]NwithoyY =n y. 
3. The sums of squares of X and Y: 
SPL EPA SEE SOI 
MX2 = (X-MX/?, and MY2 = 5(Y-MY)? 


4, The sums of products: 
SPA OB PAO 
and MXY = > (X-MX)(Y-MY). 
Bs For the regression equation Y= + 8, X+¢, the following statistics: 


Estimates b, and b, of the regression coefficients R and B, , 


with B, =b, = (X-MX)(Y-MY) as an estimate of B,, and B® =b5= MY-BIxMX, 
> (X-MX)? 


as an estimate of By - 


The sum of squares for error, SSE, with SSE = D(Y-Y')?, where Y' = by +b, X. 
The sum of squares for regression SSR, with SSR = (5 XY? +D(X-MX)?. 

The total sum of squares, SST, with SST = 5 (Y-MY)?. 

The correlation coefficient, r with f = SSR + SST, 
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OF For the regression equation through the origin, Y=BX te, 


the following statistics: 


An estimate, b, of the regression coefficient e with B =b=5XY = 5 X?. 
The sum of squares for error, SE, with SE =D(Y-Y'')?, where Y'' =bX. 
The standard error of the estimate, EE, with EE® = SE + (N-1). 
The F statistic for the hypothesis that b = 0, with F = »X® = EE having | and 
N-1 degrees of freedom. 
The F statistic for the hypothesis that b =MY, and that X=1, 

ape Casa (a x2 /ny\/29p 

2 x? = 5 x(n x2/N)/¥) x SSE/N-2) 


Whee eas algorithm then prepares an Nx4 matrix REM, with the first 
column being t= 1, 2, 3, ... N, the second column the observed values 
Y, the third column the calculated values Y' = bX, and the fourth column 
the residuals (Y-Y'). 


The algorithm REG@ is shown CR pee ie 
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APPENDIX J 


COMPUTER PROGRAM TO CALCULATE STATISTICS 
OF THE RESIDUALS 


REM is defined as the following matrix: 


Column 1: = The year t= 1,2,3,...N, (N = 110 in models where data were 
used from 1859 to 1968.) 


Column 2: — The observed values of the dependent variable, z. (i.e.--The 


values for the left-hand side of the equation in any model.) 


Column 3: a) For the case of regression through the origin, the values 
estimated for z' = bx from the regression equation z = bx + e, where 
b is the regression . coefficient and e = e(t) are the error terms, 


or residuals. 


b) For the case of ordinary linear regression, the values estimated 
for za= b>, + b,x trom the regression equation z = b, + Orme. 
where b, and b, are the regression coefficients and e = e(t) are the 
error terms, or residuals. 

In the nonlinear models, the computer algorithm TEST was used to 
first estimate the values of the parameters u, v and w such that 


ye? was a minimum. 


Column 4: The error terms, or residuals, e = z-z'. 


In the case of ordinary linear regression, REM was prepared using 
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the the functions REG, INV, and RES, as described for the 
University of Alberta APL\360 system. In the case of 
regression through the origin, REM was prepared using the 


computer algorithm REG@® described in Appendix |. 


Using REM as a basis for all calculations, the main APL 360 function DOC 


calculates: 


| By means of the invoked function DSTAT: 


os 
e 


The number of residuals, N. 
The maximum residual. 
The minimum residual. 


The range of residuals. 


The mean of the residuals. 
The variance of the residuals. 
The standard deviation of the residuals. 


The mean deviation of the residuals. 


Ou COM NI One On COM INS 


The median residual. 


I The year of occurence of the residual with maximum absolute 


value. 


HI By means of the invoked function SKKU: 


1. The coefficient of skewness of the residuals, SK, an 
estimate of its standard deviation, SDS, and the t value, 


SK + SDS. 


2. The coefficient of kurtosis of the residuals, KU, an 
estimate of its standard deviation, SDK, and the t 


value, KU + SDK. 
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IV An estimate of the serial correlation of the sequence of 
residuals R(1), R(2), R(3), ..-, R(N). The sequence is 
divided into two series S1 = R(1), R(2), R(3), ..., R(N-1), 
and S2 = R(2), R(3), R(4), ..., R(N); the simple regression 


coefficient between SI and S2 is then calculated. 


V By means of the invoked function STATRES: 


12 


The sum of the N residuals. 


2. The sum of squares of the residuals. 

3. The Durbin-Watson statistic. 

4, The number of positive residuals, p. 

5. The number of negative residuals, n. 

6. The observed number of runs (m) of consecutively 
positive or consecutively negative residuals. 

7. An estimate of the mean of the population of runs, u, 
expected in a random sequence of p positive signs and 
n negative signs. 

8. An estimate of the standard deviation of the population 
Ofnuns; Oo Di= ole 

9, Thet value, i.e. - (MEAN-RUNS) = SD, as calculated 
above. 

VI An estimate of any parabolic trend in time for the residuals; in the 


equation R(t) = 2, + At+ Bf, the algorithm estimates: 
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The unknown regression coefficients ® ,B and®. 
The standard error of By and B’.- 


The t value of By and 8. 
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4. The sum of squares for regression, and the sum of 
squares for error. 
5. The mean square for regression, and the mean square 
for error. 
6. The F value for regression. 
7. The standard error of the estimate. 
8. The square of the multiple correlation coefficient. 
(The invoked fuction REG was used for the above estimates; the 
F value for regression and the t value for Oe were the main test 


statistics desired for this case.) 


The functions DOC and SKKU are displayed below. The functions 
DSTAT, STATRES, CM, REG, INV and RES are a standard part of 
the University of Alberta APL\360 system 1 
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V ANSWER<+DOC REM3R3;YM3;N;SC;PR 
Beale DSTAT R+REML 34] 
2] Y¥M<1968-(N+pR)-CJ/R)iCT//R) 
[3] "YEAR OF MAXIMUM DEVIATION '. YM 
(ey SKKU R 
ES SC<CMO(2,N-1)e(C1vR); 1¢ek 
[6] “SERIAL ‘CORREGALION oe beso 
[7] STATRES REM 
[38] PR ACLIS) REGO Ce G (ae ia Cy) 2.) sae 
[9] "PARABOLIC REGRESSION OF RESIDUALS WITH TIME';PR 
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[3] STANDARD DEVIATION "SSDS 
[9] ' T-VALUE eres DS 
[10] ‘COEFFICIENT OF KURTOSIS "SKU 
ae STANDARD DEVIATION 's SDK 
Rae {VALUE 1. KU+SDK 


me oa Tomy, 
: ne 7 par 

moupe nner olf big stein hee ai wk ‘ 
Uf 


o 


 ,otornifza ont! te wort qwehit % 

Aieistieos oottslbidéealqithar edt ‘oars oft. 
off watom es wvodp ell tol bert sow OA saltout badlowct i ‘ 
feo? nine etl arow mY a wis Porth boc miaptpes apt wuluy 7 
“Leaoo ath xe botdeel-eoPetiesta *, 
7 7 

notional of? .Weledbewe liptbeono  ONF2 eer 206 snobitnet fT 

Jo rag bupbinsie -sr0l229 Sno M4) <OML ND) BTA AT2N 7 
Vest OCA cote os ial 


vtaesens 
27 he VA ea ot SK WeRh V: 
| Los earerh Ties (tJ 
(ni \ yi )- thee eaeet (20.5 2 


wer LOLA INSG veel kan “i gala? Ir} 
Hin” . 4s hobs  S9 
Post Vitis KC ECARD aa 


ui SH WA. ant a ear, 20 eee prey aut sn 
sa ) i 


ate ag a yaspeaihy hy aE Chee et 
Sab SS eee <7 “eh tt yee Re ee 
; ¥1h- +uhe (ita FAME ar rive 

ooh) recbest RD PAG 
sas -tfexs . uty 


- A33 - 


REFERENCE: 


Te K.W. Smillie, STATPACK2: AN APL STATISTICAL PACKAGE, 
Second Edition, Publication No. 17, Department of Computing 
Science, The University of Alberta, Edmonton, February 1969, 
PprulOn 20,020,277 oer is. 


pdtv? Io toocntioge®., a Lol potioat le 
POY] yiowsdet einonhd arog 


*]2A2] BLO S44 IO JUDdIGIUBIS yy 


A A AV AV B;GdIIDA juspusedagq 

49S" B6~ #0" ZOE eel PLL x LOEL- JO118 pappunys + *q = “4 

OLL02z- L6€70"0 L-328L"€ Z1610°0- tq 4yu9191j}209 uoIssouB9y 

6081- Z°OLL Lo°9€ 76° O0- “q 44Ua191Jj909 UojsseiBoy 

L=A6L7 L 6€2° | OOZOZ1 009C¢S n ‘iajewpind @|qD1410/\ 
: ¥ Buinpyjoul ‘siojowioind auj jo sajowissy 
© #6886 °0 #%8866 "0 #8166 °O xx LE66°0 BPO MOEIS ELEO 
; 62S2 C°SC8 6L,98 IZ Te ey tea oa jracseeieh SEIS ERIS 
8478 °9 ZAOLGL 000908 008029 . saipnbs jo wins JO4119 
ZO1 90 | ZO ZO} JOIIS JO} WOpses Jo saubSq 
Oli 601 OL OLL STS VET SICEIS IE SSPE TUS) 
c2W »LZW x1 x0LW ~DUSILWLS 


CCW GNV «lZW’xeLW “x0 LW STSGOW YVSNIINON JHL WOds SLINSAY JO AYVWWNS 


"0% J HONOYHL SNIT NOISSIYOIW V ONISN AG GANIVLEO “eZLW GNV «SLW ‘s8LW 
“0DIW “aSLW “26W 72ZZ7W “xl ZW “x€LW “x0LW STSGOW YVSNIINON SHL WOdS SLINSAY 


>» XIGN2ddV 


7a aA = ~ Pe 
,*SSM "TSM ,*EIM 4! 10} YM SHT M6 

Mew esS' s+ HOG rs Sie eT ' 4S A —_ FTTRA OWA 42 
j22390 38 A OVI2U Ya G3 ATSO “NIM GviA US 


— a a = 


2]3G0M SAS4LIMOM SHT.MORT 2TIUZSR 


seta 0H 


€ bb Ove ee 
slnubtee: ent lo ene Sepia 
aaa 


Apel 
xe pie ‘fe erase it ote Sa snort” On 


* 


s «ss a = * Eake 
- i anivilor! metering eftie aston ze 
| i pone pcb 
* i 7 
; ut 7 
octet! OOats » ietsranog eldoha¥ 


Cs 
- ea ‘ 7 = 4 
f H © 1 ot ineiotttoes nodesiost 

iS oe shecail Ay he ee ee oe 
ee 
yes 


= a - 

™ Ye 2 fee ‘ P . bweh 2 
-. * - ° oa Se ‘ 4 “| al hs , 
"ld . td ao NPY ts VEN t'Ja ur wah gt S13 S00 TR : 


brobaste = 
a a 


a 


oid a t= ( ela “4 1078 


_lsvelh de 1.0 sft 40 tapaitingee * 
= rs a ~ 7 
Pa ase a 7 


A j = - = : : = —— 


"JOAS] LO SUF $P FUDSIJIUBIS xy 


K A AV AV BIGDIIDA JUapUsdeg 
OL Oats *x9Ly V *##GC0°V olZal S15 OI 02) BOs OOS 
O99 0= GSO"? vrs" t 7 CSL a0 SisOHiny JOstueloNseCD 
SIS 0- 8L8°l *##V68°V GOe*L SSOUMS eat Os ae OR Aos 
oO OPLLO= 8027 °0 8Scl 1 LO00€°0 SSOUMO>S 0 1NOl2it42O® 
: best? GC 0cl> 88° 9¢- O62 als [OP ee SVN 
VLL?e 6°0S¢ O02 Lg 06°¢S COE Cay) 
LL&S6 Le6v ZOEY, Z Sly SUOI{OIAEP JO BHuoy 
cS6 1 8961 896 | 9S61 WNWIXDU JO IDSA 
LCV CCOS EXE SUG UO LPL Kop BUN 
Oll 601 OL OLL S|PNpIsei Oded uan 
CCW *xLCW xo LW +0 LW SIIShevas 


Se ee eee ee ee  — — — 


eZZW UNV xlZW xe lLW/s0LW STJGOW YVSNIINON SHI NI STVAGISSY SHL SO SOILSILVLS 
(penuijuod) >») XIGN3ddV 


ne tae ae 1 aS 
Ualeaa 3 aye  * 


sracease Ww? « mee 
s 


ipeohes tp triefalh 
diestrus ) suid 
. oldohov sai ngs 


en 1 
— 


-_ 


- A36 - 


*#96 LL 
*x#00L°S 


**9G6°6 
LENES 
GenoG 
67 
L9 
” 


ESS 10°0 


#0CW 


vce | 
20) ais 


*#901°8 
C6S V7 
CC OV 
37 
9¢ 
A 


6£5°7 
GLV4C- 


*x#VOV°L 
vLL°Y 
ce 1S 
be 
6€ 
91 


e8sy 0 


+o LW 


* [BAZ] %L°O S44 4° JUDDILIUBIS yy 


ccOm uo1ssei6e1 40} J 
062° |  foenpa-t 
OWI} ULIM S[ONPISEs OY} JO UOIsse1BEs 91jOqG010d 
3 V98°Z D /(w-T) fanjor-4 
gig. : 0 ‘UO1}OIASP PADPUPIS 
Wee , Tl ‘upaw uolyo{ndog 
9S u ‘suBis sA1joben 
GS d ‘suBis SA1}ISOg 
G1 wi 4SUNI PAAISSGOC 


ssuBIS BAlJOHau pud sAijisod jo suns yo JEqUINN 


1£09°0 DI4SIFOYS UOS4FO/AA—ULGANG 
+0 LW SOUSIEVES 
tSst 


ee ees 


aZEW UNV ‘alZW “xSLW “xOLW 


STJGOW YVSNIINON JHL NI STVAGISSY JHL JO SSINWOGNVY SHL dO SLSSL 


(penurjuos) 4) XIGNaddV 


4#noD) 4X 
iaitno>) A, 2 
= = a 


43 Seem Hotta 


—_—s 
- .nolieteh bipbrx 

we eee 
a 


Z ’ <4 ial tty ene 2h re c2atgSt eT so) 
7 se 


some 


= 


-~ A37 - 


AV 


##V°6C1 
€G170°0 
VOMIT Gu 
OOEeclL 
VZSZ°0 


*+#9€66 °0 
Vo. GL 
GAGLo°9 


901 
OLl 


#8 LW 


AV k i 
eeOeeol 429659 Peer 
Zev L°O OOSZr 1 oor9rs 
Z08° €- 0°027- L791 
v66L°0 G1€6°0 Z186°0 
oo0ees ZyZh l-38vv'°e 
£76066 0 $40606°0 446666 0 
L6°7Z SIKNS CeCSe 
GA8v6°S PAecca | 95Z06°9 
901 901 901 
OLL OLL OLl 
«9 LW 4Z LW wW 


* 1242] %L°O SY} 40 UDSIJIUBIS x 


a|qDlIDA yuspuedaq 


4019 pappunyjs + “q = 4 


Tq “juaroijje09 Uo!sseiBoy 
°q “JU aIdIJje09 LOIsseboy 
A fjajpoupind a]qd1id A, 
a 4aayounsod a|qDLD A 
ae) Buipnjour ‘seyouibipd ayy jo SojOW!{s3 
et {UO1{D]91407 
S|DNPISOd BUY JO JO149 PAOPUDIS 
soapnbs jo wns 1013 
JOIIO 10} WOpsdi JO sooi5eq 


suo I{DALBSGO 4o 19G WAN 


DILSILVLS 


ol UE PRS hae Dr apd” Pin OR nes en Ps ee ee 


#8LW GNV #9 LW 


(penulyuos) >) XSQNAddV 


TTIW x6W SISGOW YVANIINON SHL WO SLINSSY JO AV WWNS 


IM 2 Sf) Vin*@M 2.43 wy 


. aCe 
(Behe. if _ ete. é 


L085 


eretsmoisg art Fe 2otore 
emg sidotey - 
¥ \wtenpwd oldohrely 
4 vincinttiees: velestak > ; 


ey 
a , Risse nei eS 


: nov tas! bean 


- A38 - 


AY 


888 °0 
607 °0 
60S" | 

ccéSe 0 
60S 7> 
C755 

9 0ZE 

CCG | 

3 Sol- 
Oll 


#3 LW 


AV 


0S9° | 
ZVSL°0 
—69L A 
9Z07°0 
ZLEL*O 
G9" 87 
9°y8E 
956 | 
Lio 
OLL 


#9 LW 


6662°0- 
poh pone Oe 
702200 .0- 
V6Z100°0- 
6L° VV 

8° €0€ 
6891 

896 t 

V° S96 

Olt 


#0 LW 


LL8v°0 
661¢2°0 
OVS 
CC7S50 
63.915 
£7861 
LVL 
L561 
6°99 
OLl 


2x6 W 


a|GDLIDA JuspUusdseq 


SISOJIN} 1OF ON|DA—4 
SISOLIN> JO 4UDID1}1BOD 
SSOUMAHS 1Of SN|DA—4 

SSOUMOHS JO {UB!II14J90>) 
JONpIsei UDIPS\W 
UOIJOLAEP UDds\\ 

SUOI}DIASP JO SOuvy 
WAWIXDW JO IDS} 
UOIJOIASP WNWIXDW 


SJONPISe1 JO JOGWAN 


DILSILVIS 


a es a ee 


#8 LW GNYV #9 LWsZLW/s6W SISZGOW UVSNIINON SHL NI STVAGISAY JHL SO SOILSILVLS 


(penuyuos) >} XIG NdddV 


a uh 


tha 


» a, ws oy ” ens JAUCHI2S — 
BIM GMA 4a} F f aS 4 4 +5;8 CoA 


(is tag = 7 


Vent! tte. neo 7 


ae 


reactant — Ped a 


woanwera 11 Slay 
sa 


AeA. ri¢t.0 plepiey a to tontatties : 
52 5 ago. fPabkeg : “ ai sinneaual ror pets 
; s: a 
vA . ; . sldchient nob 
; " ’ ie i as = = 
=e a | 


= A3? - 


* [BAB] %L°O B44 40 JUDDINIUBIS oy 


Cv88°0 62620 °0 *¥ZL0°6 Ley? UoIsseiBe1 104 4 
622" | LL€Z"0 #2G86"€ 670° 2- “OgoenpPA-4 
ZOWI} UJIM S]ONPISEs O44 JO UOIsseiBe1 O1jOqD4bY 
pe leGne Tages wrice6 PR OOCS O/ (wT) fenjoa-4 
v02°S 022°S Z61°S eles DeGUOTIO Ne paps UOrs 
v8°SS 9S ers conag TM fupau UOL4O|Ndog 
8g EC LS LG u /suBIs dA1joHban| 
cS GG 6S €¢ d /subys OAls1SOg 
el Zl Ve 7. we /SUNI PSAJOSGO 
ssuBIs OAIJOBaU PUD dAlJIsod Jo sUN JO JOQGUINK| 
€609°0 VEE "0 68890 °0 G8260°O OUSIDIS UOS}OAA=ULGING 
#8 LW #9 LW +o LW #0 W SOLMSTV ES 
Usa. 


we ae nn SS 


x8 LW GNV «9LW 
‘ ZIW/26W STAGOW YVANITINON HL NI STVAGISSY JHL JO SSENWOCNVAS JHL SO SEESEUE 


(penulyuos) 4 XLGNadddV 


the on + be ce a Sian 3 


A MOgY 


ao] A 20h va bof Z A | t rf. . Pe 
mig SM BMOOMIAS . 


“—_ 2. aft 


wz evitegen bros 


2 - 


2 ea vortelpadt 7 a 
enitel yet o bast z- 

a = _ os am 

. r= yh yoleoyet : 


\ — a r - 
uislaer ‘ant to pstioniest oe 


= 


7 


*JBA2] HL" SYs 40 FUSIHNUBIS x 


AY K B|GDIIDA JUspUsdsg 
49° ZEL #800 L 40149 pappuojs + “q = 4 
PaO 7S «| 009067 “g ‘4ua1911j909 UolsseiBoy 
Se Gio Vogt °g 44U9191}}909 UojsseiBoy 
(winuixDw) 6666 °O Viace M ‘JopowpsIOd B|GDIID A 
000672 650Z0°0 A 719{9WUDIOd @|GDLID A 
© 7228 "0 ZL0S 0 n 449j@WDIDd @{qO1I0 A 
< u, Buipnjour ‘suoyowdsod ay; Jo sopowlisy 
#%6066°0 #6666 °0 gl 7UO1}D]S1407 
Gy*SZ ©°0SZ S|DNP!ISOI SU} JO JCI PADPUDIS 
O0OZZ6S EL) seupnbs jo wns 10143 
COL COL JOIIS JO} WOPseEd} Jo soo1b9eq 
OLL OLL SUOLJDAJASGO JO JEQUNN 
#ZLW #G LW SIUSITVES 


eee 


eZIW ONV xSLW SISGOW YVINIINON SHL WO¥d SLINSIY JO AYVWWWNS 
(penuijuod) > XIGNAddV 


iaea! Sete a 
se 1 2 B.1 “¥ 


stovbtes eft to tone by obe ne > 


cas = \reitslene. 


mtenmod et to esto iat _ 


u ,elsmepa sidoieV 


v later shdctan s 


w vetemmag-elds rio 
ae 
an 


slatiieon rio Eee ig 
one were 


- Al - 


AV 


LOvAL 
£O0Vv9°0 
iseys |! 
LOVV"O 
C6 yey = 
Cie 
6° 69e 
edd 
[Sst 
OLl 


#LLW 


9278 °0 
LSse°o 
VL9-? 
¥Z09"0 
Uae oe 
v° O61 
SLL 
O06 | 
L°7S9 
OLl 


*G LW 


a|GD1uDA JUspUsdegq 


SISOLIN} JOF BN|DA—} 
SISOLIN} JO JUSIOIJ{9O> 
SSOUMOHS JOF SN[DA=4 

SSOUMODS LO JUDIOIJ1E0D 
JONpise1 UDIpsyy 
uolJDIAap UDEV 

SUOIIDIADP Jo eHupy 
WNWIXDW JO IDS) 
UOIJOIASP WAWIXDW/ 


S]ONpPIsed pte) JOQUNN 


SES [UverS 


ESS 


*ZLW ONY « SLW SISGOW YVANIINON SHL NI STIVAGISAY FHL 4O SDILSILV LS 


(penulyuod) >) XIGN3ddV 


~ atelnaa eet 
notified’ ae 


maisivel pet 
= 
lotkizei webaM 
azenwsilt trsisitisod 
By jeans as? silva a 
wo! to es? ‘SHhagD. ~~ 
uel ve aylowet es 


ajdigiraw in cre Syres 400 
: - 4; 


- AA? - 


CS81°0 
ZVLS 0 


##ZV8°L 
VOCS 
Vo SS 
8S 
cS 
Sl 


Les 0 


#Z LW 


* [P02] %1°O Ss IO JUDIIZIUBIS x» 


G78" uoissoiBai 10} 4 
06Z i= “9d JO ONjOA~3 
SOW} UJIM S[ONPIsSed SYy JO UOIsseiBE1 D1 jOqnID4Y 
*%C00°6 . O/ (wT) fanjor—4 
SES D UOIJOIANpP psDpUDrIS 
€6°SS Tl ‘upsw uoljpjndog 
LS u ‘subis sAljoBany 
eS d /suBis dAljisog 
6 wi /suNI PaAJasSqO 


ssuBIS BA1ZDH9au pud sAljIsod jo sUNI JO JEQUINN| 


06560°0 O14S1JOJs UOS}O\\—U GING 
«SLW OILSILVLS 
Sal 


ee SS SS SS 


#ZLW GNV #SLW STSGOW YVSNITINON SHEL NI STVAGISIY SHL JO SSINWOGNV4 FHL SO SLS31 


(penuijuos) 4 XIGNaddV 


a (om ap ans 


ioral? pRter eloubieas att *o — 


ave 


nN 


COZ 


*%6696 °0 


74860° t 
659L6°€ 
GALCY*V 
pe oN S 


Se 
Ge 


*J2A2] BLO SUF 4 JUDSIIUBIS xx 


QIGDIIDA JUepusdeq 


Qe aout 
stsoujoddAy aus sop 4 JO ON|DA 


gl ASONJDA P2LOjNDINS 
UD peAlasqo UsEeMIJoq UOLIN]OLI0 
4 GO Poupieie 


SJOWIISA YL 10 JOLIE PANPUDIS 
souonbs Jo Wns 1019 
q 7JUuaLo1Jye09 UOIsselHOy 
9 JJOPAWNINd S/GDLID A, 
ssJopoudind aul JO SoJOWIISA 
JOLIO OJ WOPSEJ JO sooiH9q 


SUOILOAIOSGO 40 JSQUNN 


SLES TEV LS 


SETS 


1 SVD YO4 TAUGOW SHL WOM SLINSAY JO AYWWWNOS 


IA aSVD GNV ‘A 3SVD ‘AL aSVD “HL SSVD “1 aSv> “1 ISVD NOI STIGOW YANIINON 3HL WOdd SLINSAS 


1 XIGNaAddV¥ 


- 7 i i t as 7 

32A5 It! Eze} ee angen 

< ia = - 7 ss a _ ==" 
oe | 32> 507 330) MM SHT Mos? 217JU2389 30 ¥ 


> == | 7 ae emcieG. dt © sisal = 

oe < t~3taf 2 3 . LH EMPIOG > sidsncY iy 
aacth.e d \snolswow aateemgut 

E . ; : ; = ‘fate o- . es iaupe ic a d 

oe) ae = ctamites off Toucms mbna 


ban bays cdo. Ca vated ro BelsneD - 
=*00480_5 stsviocy. es 


- AA4 - 


b 


eles 


GASSO* C- 
GAZL9" k 

93661- I~ 
L-AV.C0¢ 


+xLL66 “0 


BIEL 

bee Jefe) S) 
CE 

ce 


dS To 


A 
09 L6= 
C56 16° €- 
GASSL*E 
DAV VO. L- 
8-47L0°V 


*%5966 ° 0 


cOEL 
LAaycy’S 
Ce 


nN 


#%90° 8L- 


GA076° S- 
GALL6"€ 

€46¢5,.2- 
[EAS 


+%666 0 


0°66S 
94905 ¥. 
Al 

Gt 


Al dSVO 


nN 


#%09° S9= 


CAGLS.8- 
G1088° © 

Oso lel 
P=AQyL 9 


+%VS66°0 


LZLe 
83602* | 
75 

0c 


lets vo 


*1OAS] HlL°O SUF JO JUNSILUHIS x» 


nN 


x2€° SV L~ 


CACLO S= 
SI0C6°E 

OAcC lal 
L=a01V 9 


+*5866 °0 


SIGDLIDA JUepuedeq 
Joule pappupjs + tq= 4Y 
To fJusporsje0o uolsseiboy 
Og 4yuatot11e809 UOIsse1Soy 
Q ‘aapowpsnd s]qDIID A 
> /jJOJOWNIOd SIGDID A 
ssloLOWDIOd Sul JO SopOWIISZ 


Z 
oi SON|DA PSLDINDIHD 


pup PpeAissgo UseMLOOG UOSLD JOIOF) 


Ecve 
83628° L 
Ce 

GE 


IP asVv 


IA dSVD ONY A SSVD “AL SSVD “IN SSVD “I dSVD YOs STISGOW JHL WOU SLINSSY JO AYVWWWNS 


(panutyuos) 7} XIGNaddV 


syOnpisel oul 50 IOIIS PADPUDIS 


sospnbs Jo wins JO115 
JOO JOJ WOPsads Jo soe16Eq 


SUOIIDAIOSGO JO JEqUAN) 


DILSILV LS 


2A NAL Bea? IB ames 
: eal ——s ae > Vi oe nian tod 7 


= ae 


ee er a ee | 
ca vee — ee 7 — 


ser - — Oe Nie Eas elanbizas alate 


ai iy YS 
5 ; bro baviaude print ee! nol loteD 
7 #09 6 HOO. SHOVES 0 *PKEOR DO: = EOE. 3 yzeulby — 
: : 


io - fist ihe) WG 2; inte mom 9. 


1 _ 


i - 
1 


- i 7 Tt ; hee 3 — =m oe ee 
Yaato.¢ 5-3PN0. 3 woh TEE VaR 6 -5Gi3.8 9 \lehanertatl sides | Dy 
2 ; J 7 _ 
« —_ . r eT e by r == * o ad S\ , 
“$3921. /- 6324), i- eveCE.. 4 Stal yt SgesT.t- 6 isiioipe Sect 

7 : i 7 
2207s. | rer ee 2a05G £ 23050.€ PIGse . C 4 tinainitises aietnenay x 
ae An0fO ¢£ aA” =e rS 2, ASSOC os a ee elect iti =_ 
eset). S- cs ; ret" Law? « Ladakh’ Wea Va 8 We U™ £ tA 455) i yews — 
' f = : ; : . 

oF 31 t- 82, 12— om he aie =oe Car “FE Bat = Taiis bisbrote:+ rt 


~ b= 
re X u u us sldenov —— 7 
~_ 7 7 oe 
=) 2 @ 
= 
- ro 
io 


“he ) i sae 7 pie. oT 
ol - love) of 1.0 til i) ni - 


a 7 


i . ia > 6 


Ase 


86S1°O- 
evel Os 
vz0° lL 
€ZOV 0 
6° LEC 


S572 


INISYD A SSVD CAL aSV> ‘Ill 3SVD “Il aSVO 71 aASVD dOd STAGOW SL NI STIVNAISAS 


2£06*0- 
950L°0- 
L8S° 1 
£829°0 
OO) ee 


Coals 
CS8e*0- 
V6LS°0 
ELOE*O 
ZELOV 
0 

006 | 
£56 | 
ovol 

0° rer 
1 


INS S62) 


OL78°0- 
97.68 0- 
0025°0 
SGT 
9 60 7= 
0 

Orcol 
9€6 | 
0L9S 
Cvve 

02 


It 3SVD 


€S08°0 
STEED 
ySce"0 
V67L°O 
aC om 
0 
OLZOL 
VE6l 
Ofys- 
OT 1 
Ge 


at ®) 


v6S* | 
Ove* | 
SVG. c- 
Cv68°0- 
6°899 
GURA 
O&cZLy 
VEL 
OZcLe- 
Cvs 
Ge 


leS vo 


(penulyuod) } XIGNaddV 


SISOLINY JOL ONJDA4 
SISOLINY JO JUDIOIJJSOF 
SSOUMOHS Of ONJDA=4 

SSOUMOMS JO LUSIOIIIEOF 
JONpIses UDIPSy 
sJONpIsel Jo UNSW 
SUOIJDIAEP JO EHuny 
WNWIXPW JO IDO) 
UOIJOLASp WWAWIXDW 
UOLIDIAEP UDS\\ 


SJONpISol jo JOqunn 


DILSILVIS 


FHL AO SOILSILV ES 


ae 
enottaivel %: ere 


 sleubiiea -ansM _ 


* ~ * ome =, 
loybirsar oe iboi 
7 


= 


~ § BSI 0 eCt0- - GEAE.O- ais, A $6 Ste Cet. © sear? = tasiaftiens 


~ ere ‘sear. eope, oe ‘gises0- 5268.0 epi toned i alot 
on = i es a 4 = = 


EN Ges 


¢1080°0 


GCLL0 0- 


#98S © 
268°C 
Le Od 
a1 
él 


WN asVO 


cL8V°0 
Gyo. 0 


*%LG6°V 


€S8°C 
vI°sl 


OOEL°O 


MESS) 


€SS7°0 
GIL> On 


8682°0 
£58° 1 
L97°8 
Z 

8 

vs 


6L0 a1 


NaS VD 


IA aSVD GNV A aSVO 
TAL aSVD ‘Ill ASVD “Il SSWD 71 SSVD YO SIZGOW JHL NI STVAG ISd¥ FHL AO SSINWOGCNV8 SHI SO Sisal 


€vzy00°O 
VV7S0°0 


989°C 
G80°2 
ye(9)| 
Al 

8 

c 


c60* | 


Hl aSVO 


L8LL°O 


v6Ccv"°0- 


*G68°C 
268°C 
(5S, 1) 
él 
Ds 
OL 


Ov0* L 


Welt) 


eajqooijddo jou ysoy °peu 
*J2A2] % LO SYs 42 HUBSIJIUBIS ax 
*J2A2] %] OY} HD FUDOIJIUBIS 


etre uoisseiBal 10} 4 


ya Bd 10 on[DA~ 
x#89° ZI ue JOA=4 


SOU} ULIM SJONPISAL BUY JO UOIsseiSe1 D1;OqD1bg 


wa lO°S 0 /(u = TI) tenors 
ZAG Se OD fUOIJOIASP PIOPUDIS 
6y°st 1 Funaw uo1pyndog 
ZA u ‘subis sAlpoBany 
8 d /suBis SAILISOg 
€ w ‘sun! poAlssacC 


ssuBIs BALyOHeU pud aAIsIsod Jo suns Jo JEqwaN 


epeu ONISIIOJS UOSIDAA=UIGING 
1 4dSVO DILSILWLS 
Sat 


(penuryuos) ] XIGNaddV 


= 6 
cata a .7 


Fy! ul 132A 1% 


 yigaaD 


af tie 


WS 


(bse 


ae 


writin) 1X13 a ae 


ee 


+ OMAR 
eo fae 


il 3cAD 


* gO, 1 


Det 
engi evogan bas 
Of 


th 


La] 
y aoc sohtetiaget a 


foltsived biplnote * 


4 
«= \ 


GN if nm. w “ wlloge 
inubizer sc# Jo neiesibet Stloduyst — 
rx 
st 2 sulin 


*JOA2] FLO S44 HO FUOSIUOIS x 


A @{GO1IDA sUspusdapU| 

AV a1qDiIDA juspUusdeg 

9LVE0"O 0 = %q yous sysaujodAdy ous 40} 4 JO ONIPA 

#0986 °0 = Isan|DA pejD|Nd|D9 PUD padiesqo USEMJEq LOLP]S+40) 

x L901 SIDWIJSO OY} JO JO1IS PAOpUd} 

x 006785 soipnbs Jo wins 1049 
| 6091L°O gq ‘pUSIDIJ1809 uolsseiboy 
yZ91°O d ‘sdyawoind ajgniin A 
O06ESE y ‘yapowoind seqolio A 

ssloJOWOINd SU} Jo SeJOWIISA 

cS JO1I8 JO} WOPse.} Jo seo1H9q 

GS SUOIYOAISSGO JO JOEGUINN| 

SITS IS 


SLINSSY JO AWWW 
i ee ee 


8I6L OL VI61L WONA VLVG ONISN “9LW TAGOW YVINIINON SHL WOU SLINSSS 
W XIGNadddV 


St Go a. 


T0238. a0 


hg aT y = ; : 
{sts smog rote _— 


NOr.0 v ¢ sterense slain 


- - : . - i * -- « — eg _ 
ott ins ; aa! - Co cS , ss s00 fo a bal = 
ry <i — oS . . 
a ootage rereupe Jo. ie fen 
oT i somites ort 3e rene tape 
7 aa ; > a ; . 
a. = Oe 5? .0 1 steultv betalosloo ban bovierds seated noiteteneo - 


df hEO.0 => gt seerhogul sak 1t 4 daa 


. vA _  ahdnitey testemge, - = 
: 1 OS 
ze . el, latte v tes a 


x ie Fe 


o* = -seal a 


love) 3?! .0 sr ap tneolt male * 


af INE 


BOE 
82S I> 
7eVS °0 
6cve°0 
LC Ge 
ZO" II 
6° €0€ 
€26 | 
Cae! 
Ge* V6 
LE 


(2961 - S161) 
SuvaA ado 


v6v8"0- VOL" In ona 


1602 70= PALS SISOJIND JO FUDIII}}OO™ 
LOLA) €099° 0 oNn|pA=4 
Cyl 0 7Z|Z°0 SSOUMOHS JO }UD191 4190 
CO O0C- WG bbe JONpIses UDIPEY 
C3, on 97" | S|ONPISEl JO UDS\/\ 
€ E66 © C6e SUOLJOIASP Jo SHudy 
CES | Axe} WANWIXDW JO IDS) 
Lele 1 SN UOIJOIASP WAWIX_Yy 
V9° 28 82°88 UOlJOIASpP UDSY 
82 GG S|DNPISOd JO JOGUINK\ 

(8961 - 7161) (3961 - 7161) DILSILVLS 
SUVSA NSAG SUVSA T1Y 


896] OL vI161 WOU VLVG ONISNA 
ES 


SIW TZGOW YVSNIINON SHL NI STVAGISAY FHL SO SDISILVES 


(penuiyuod) W XIGNdAddV 


eee corre =" re 
8M 800M 3A3/1101 3HT 71 20h 1235 BHT °F 


a 


- RRABY WANS : “20A3Y JA — _ o_— 


{Sa¥ i Le diel) ‘earl - aide aaa - 


no: tnivak prea 7 
a 


A Gat : a Sl risoint io zy 
: ay 7 —- 
t.ott fe BRE znoltplved 16 span 


a a ° ‘ ~ i = - 4 _ 
=H FEC 8+ ws zloubizer Fo nos 
TALS ss Bo #)S- vs, S&- _ loublest dol ‘een 
et. bit. 0 -O1S..2 2e6nware te visas 


FC oa ep cieotwuil Ye: trina . 
im 7 << s 


- A49 - 


€v0S0°0 
LLZ700°0O 


Coe | 
WS"? 
Srv 
vl 
cl 
Ol 


(2961-161) 
SYVJA GdO 


95660 °0 
LSvv 0 


806° L 
€8S °C 
conv 
Gl 
el 
Ol 


(8961-7161) 
SUVIA NSAI 


sajqooijddo you si jso} OY *D°U 
*JOA2] % {70 SY} HO FUPIISIUBIS xx 


ZOL90 °0 uoisseiSa1 JO} 4 
€ZOE "0 , JO ON|DA=4 
SOW} YLIM S[ONPISes OU} jfO UoIsselHa1 DijOQRIDY 
x%C8V V7 o/ (w-7) ‘enor 
GIDAE D ‘uoljOIAep psppuds 
CV 82S Tl 4upaw uoljojndog 
5c u ‘suBis sAlJOHaNy 
9¢ d ‘suis SAIfISOg 
Cl wi ‘suns peAlesqO 


ssuBis aAlyoHeu puD DAIJISOd JO SUN JO JOQUINK| 


“p°uU DIYSIYOJS UOSFDAA—UIGING 


(8961 -VIé61L) 
CNVa ily. SIESILV IS ass 


nT 


8961 OL 7161 WOYA VIVG ONISN 


eae ee sae Se ee ee oir 


“OIW TSGOW UVSNIINON JHL NI STVACISAY SHL SO SSUNWOGNV/8 SHI 3O SLSSL 


(penuijuos) Ww) XIGNaddV 


ino) MS Az a. ; 
tesur ae Ki alla q9A 


. Cr 
tM 13001 4 4a NOK 3 sf 3H 30-2234 th ioaisa a 
is <= = | 
a - a si 5 
mean | “Bady sh i et 
= 
> 
Pen 
1 


ey 7 Bari =afel) (S081 ~ ath ST —" 


——— — 


- 
Ls aE aS on a 


— 


sian : - , : = era 


e455 7 fa . 


> 
<r aTtegee 


-sidosily ton 
s a 


; *|2A9] % 170 OYE 4D HUOSIJIUBIS x 


({-4) A B|QDIIDA juspusedepu} 

G)AV B|qDLIDA ;Uepusdeg 

8S9€0 "0 0 = °q wus sisoysoddy sy JO} 4 40 ON|PA 

EO CR6 LO RiP e Qin pei2|i2(re pUD paAlasgo UBSEMjoeq UO]}D|S110> 

Zant SIOWI ISO 9UL JO JOLIS PINPUDIS 

2 OOZLE9 soupnbs JO WAS JOIZ 
< 902¢°0 gq ‘JUB191}j909 Uo!sse1Boy 
00Z1°0 d ‘sayounind e|qoLiDA 
000zZSE Yy {4eyoUIDID SO}qDLID 

ssdOjowoIOd OU JO SoJOWI4S 

ES JOII@ 10} WOPdEl} JO soaibeq 

GS SUOIJOAIASGO JO JEGUIAK| 


SUSI Vis 


SLINSSY JO AYVWWWNS 


Se 


8961 OL E161 WOUd VIVA ONISN “S9LW TECOW YVINIINON SHL WOdd SLINSSY 


N XIGNdddV 


. 
— as eh I > alles = _ @ 


fact o> eee 2eT.JU234 4O VRAIS a 


——. <i os re 
mann = 


Sea 


G istemetne ole 
ide 8 


4anae ares 7 és 
& , ris Di SOD Tr aece 


‘ Pelee _.« . _ « BBInge tae 7 
—_ 
ad Saat f -Simert hes hae rons beh boi 
a : a : oe a 3 ; 2h ‘Moers Dein bei bole nsswehsd stale : > - 
: . ; : Tete 


eh sHtegyris it “at 42 ‘3 auto’ f 


ry ace 
: 7 . Abie, indie 
; : sided sabres 
= 
, 
- 
_ - 
- 
= — co 7 
7 aod a _s he 
-lovalat 0 silt a eesitingig =~ 
: . i i 
ae = a 


: - —— 


-ASl - 


Sez |e 
as 
89L5°0 
€8Sc°0 
8¢ SC- 
Z8° Il 
v°1ce 
e726 | 
7°68 
LZ°86 
LC 


(7961 - SI6l) 
SYVJA ddO 


cCV8 O- 
66CL 0= 
GéEE°0 
967 1°0 
65° Ic- 
OL/a Sm 
v° 607 
CCS | 
G*L0c- 
I8°S8 
8c 


(3961 - vIé6l) 
SUVaA N3AS 


Viale 
collate 
G299 0 
6V12°0 
82° Sc- 
8Se" | 
y° 60V 
C€6 | 
CeZ0Cs 
6£° 16 
GS 


(3961 - 7161) 
SUVA TIV 


SISOLIN JOL ONDA} 
SISOLIN JO {UDIIILJSOD 
SSOUMOHS JOF ON|DA-}4 
SSOUMAHS JO JUSIII{OOD 
JONpises UDIPSOYy 
SJONPISO’ JO UDSY 
SUOIJOIASP JO aBudy 
wNWIXOW Jo IDO) 
UdIJOIASP WAWIXDYW 
UOIJOIAep UDS\y . 
SJONPISel JO JEQUINK| 


SIESHIVES 


ee nn ee ee es ee ee ee ie 
SOIW 1TZGOW UVSNIINON JHE NI STVACISSS FHL 4O SDILSILVLS: 


(penuiyuos) N XIGNdddV 


Lo _ = : é. —— = 
-2RAaY wave 25AIY IIA 


Gal. ~ 3191) i -~aRp- — 7 


(Bee 


Sev} Te eee ei 4 
za : ae —— 
Bie +, <0 OE #n alinivel tomgenh 7 


zlov' set o-nBalt 


_ 
: ‘levbiney AaPkSih = © 
mac; or ee ne oor sie i «- bane =e 
_ en «|: axa! mee viNer? ke Inetisitteg / 
: ' = ~~ ei > 


° saie.¢ ae. loAe zatwe-= 1 soled tA 


nig! te trelolitseae: : 


. , = 7 
elacius se? Sule? = oe, a 


009V0°0 
995 L00°9- 


LES | 
VS" C 
sy°vl 
al 


eo 


-p°u 


(296L-S161) 
ova, ddo 


01260°0 
G9EV *O- 


806° | 
CBS" c 
co: VL 
Gl 
fl 
Ol 


“peu 


(8961-7161) 
SUVZA N3AG 


SOLW TIGOW YVSNIINO 


8ZZ50°0 
£062°0 


#xC87 7 
GOs 
CV SC 
62 
9¢ 
All 


S}ayO {0 


(8961-7161) 
SUVIA TIV 


N SHL NI STVNGISIY SHL JO SSINWOGNVY SHL JO SLSSL 


*}2AQ] % [70 SU} 42 FUOSLJLUBIS x 


uoisseibo1 10} 4 
EJ 36 ON|DA=4 
VV 

TOW} UJIM S[ONPISEs Ot 4O uoisselGes D1;OGD1Dd 

o/ (w=) fenj[DA-4 
OD ‘uolpolasp pappudts 
7 “upow UuolyD|Ndog 

u ‘suBis dAlpobeN 

d ‘suBis SAI}fISOg 

w “suns PSAIESGOE 


zsuBis BAlyOHou pud SAIFISOd JO SUNS fO JOQUIAN 


D14SiJOJs UOSYOMA—ULGING 


BS) PES SESMES 
igs 


ae 


(panuiyuod) N XIGNdddV 


| IASON 3HT M41 2aAuolesn ait 30 2330 
ols 5 — os — — 7 
SHAR 3 \ 
nies . mel -sion 


— 

— a an, 
a 7 

a =i 


ae 
ee - ; _ : NE : evitonen baw el tag 
OT | 


mii ,Eaie avitogel 
4 Tesh sobialucst 


| sa i 
= a nas alow -t 


louistess ot FO ner Sips sled 


ange meas Sane 3 or eet Pe ne geet Se eeerme ee ee et = as oe leas. oe 8 Ra = : ae are == : Sane 


